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concept  that  ability  to  lean*  a  miniature  and  representative  aspect  of  a  job  can  serve 
as  a  predictor  of  ability  to  learn  the  job  as  a  journeyman ,  The  Latter?  was  admin¬ 
istered  to  50  black  sad  49  white  recruits  who  were  below  the  minimal  acceptable  score 
for  admission  to  the  machinist  mate  school  training,  as  measured  by  the  usual  Navy 
written  tests.  These  recruits  were  placed  on  the  job  and  their  leyei  of  coapetance 
was  measured  through  wrvrk  sample  performance  test  ssethods  nine  souths  later.  It  was 
possible  to  acquire  certain  criterion  data  fov  29  of  the  black  sad  25  of  the  white 
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performance  criterion  than  the  usual  Navy  tests.  In  a  considerable  number  of  cases, 
the  "low  aptitude"  sample  who  had  surpassed  the  minimal  Navy  test  scores  and  who  bad 
entered  the  specialty  after  attending  the  Navy  school  for  machinist  mates.. 
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ABSTRACT 


% 

The  initial  validation  of  a  nonverbal,  culture -r air  Littery  of  tests  for  pre¬ 
dicting  performance  of  Kavy  machinist  mates  is  described.  The  battery  is  based  on 
the  concept  that  ability  to  leara  a  aiaiature  and  representative  aspect  of  a  job 
can  serve  as  a  predictor  of  ability  to  learn  the  job  as  a- journeyman.  The  battery 
was  administered  to  59  black  and  49  white  recruits  who  were  below  the  ninical  accept¬ 
able  score  for  admission  to  the  machinist  mate  school  training,  as  measured  by  the 
usua 1  Navy  written  tests.  These  recruits  were  placed  on  the  job  ana  their  level  of 
competence  was  measured  through  work  sample  performance  test  methods  nine  months  later. 
It  was  possible  to  acquire  criterion  data  tor  29  of  the  r-lack  and  25  of  the  white  sub¬ 
jects  *'  The  results  indicated  the  performance  battery  to  correlate  higher  with  the 
performance  criterion  than  the  usual  Navy  tests.  In  a  considerable  iranbar  of  cases, 
the  "low  aptitude**  sample  performed  better  on  the  criterion  tests  than  persons  in  a 
control  Sample  who  had  surpassed  the  sinusal  acceptable  Bavy  test  score s  and  who  had 
entered  the  specialty  after  attending  the  Kavy  school  for  machinist  mutes. 
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CHAPTER  I 


INTRODUCTION  AND  PURPOSE 


The  present  report  is  the  second  of  a  series  in  the  area  of  "nonverbal  and 
culture  fair  per  romance  prediction  procedures.”  The  first  report  in  the  series 
(Siegel  t-  Bergjsao,  1972)  presented  the  logic  of  the  present  effort,  the  tests  develop¬ 
ment,  and  initial  findings.  The  present  research  program  is  based  on  a  concept  re¬ 
lated  to  test  "culture  fairness”  or  "culture  freeness,"  but  is  sore  properly  associated 
with  a  "culture  loaded"  descriptor.  By  culture  Toaded«  we  mean  performance  prediction 
on  the  basis  of  instruments  that  have  been  loaded  in  the  job  culture  of  interest . 

The  contention  is  made  that  a  person’s  ability  to  learn  a  job  sample,  can  be 
used  to  predict  his  ability  to  learn  and  perform  :he  total  job.  Accordingly,  the 
demonstrated  ability  to  learn  selected  job  aspects  is  employed  as  a  predictor  of 
ability  to  learn  to  perform  the  total  job.  The  job  sample  tests  (mini  tests)  invoj.ve 
no  written  materials  and  are  relatively  free  from  the  usual  confounding  effects  and 
influences  of  academic  learning. 

Tha  specific  purpose  of  the  present  effort  is  to  investigate.  In  the  Navy  content 
assessment  ®ethcds,  techniques,  and  procedures  which  are  free  frees  the  biases  ascribed 
to  sore  conventional  testing  methods.  This  study  does  not  focus  oi>  testing  approaches 
t  hich  possess  equal  predictive  validity  for  both  high  and  low  aptitude  groups.  Such 
a  development  is  believed  to  be  desirable,  but  not  neees  ;ary.  Alternatively,  the 
assumption  is  made  that  normal  *<iavy  testing  procedures  (?JC?  ♦  ARI  +  SrCH)  are  suffi¬ 
cient  for  persons  possessing  high  aptitude  as  measured  by  these  methods.  Persons 
who  achieve  high  scores  on  these  tests  will  not  be  affected  to  the  extent  chat  their 
progress  in  the  military  will  be  debilitated  in  any  way.  Alternatively,  the  Kavy 
career  of  persons  who  scare  poorly  on  these  usual  Navy  tests  may  be  unduly  affected 
ey  the  cultural  factors  discussed  in  the  previous  and  present  r<d^>rt. 

The  specific  research  steps  urchin  tne  total  investigation  include; 

1.  Development  of  a  sample  of  miniaturized  job  learning 
situations  (tests)  relevant  co  the  machinist  sate  IHH) 
rating  in  the  Havy.  These  miniaturized  job  learning 
situations  are  called  training  and  evaluative  situations 
in  subsequent  portions  of  this  report. 

2-  Adm inis  era t ion  of  these  tests  to  a  sample  of  low  aptitude 
black  and  white  recruits  and  assigning  these  recruits  to 
predicted  successful  and  predicted  unsuccessful  groups  on 
the  basis  of  their  test:  scores. 

3.  Assignment  of  ail  persons  siaspled  to  Fleet  jobs  in  the  machinist 
sate  rating. 

4.  Followup  after  the  jtrjup  has  nine  .months  of  Fleet  experience,  to 
determine  the  degree  of  on-the-job  success  experienced . 


5.  followup, after  the  group  has  18  months  of  fleet  experience ,  to 
determine  the  degree  of  on-the-job  success  experienced. 


To  date,  the  first  four  of  these  .rteps  have  been  completed .  The  r-eaalrilng 
sections  of  Chapter  I  of  this  report  include  an  extension  of  the  literature  review 
presented  in  the  initial  report  of  this  series.  This  literature  review  contains 
discussions  of  the  following  relevant  topics:  (a)  test  and  measurement  error  for 
var ‘ ous  socioeconomic  and  racial  groups,  (b)  personality,  achievement ,  and  self- 
concept  factors  affecting  test  scores,  (c)  effects  of  race  oi:  proctor/experiaenter 
on  test  performance,  id)  prograss  for  disadvantaged  and  ghetto  youth,  and  (2)  cogni¬ 
tive  style  differences.  The  literature  review  provided  in  the  prior  report  included, 
but  was  not  limited  to:  (a)  legal  aspects  of  employment  discrimination ,  <b)  problems 
in  establishing  transethnic  group  test  fairness,  (c)  studies  into  differential  validity, 
(d)  qualitative  differences  in  intellectual  functioning  and  performance,  (e)  perform¬ 
ance  comparative  studies,  and  (r>  motivational  considerations. 


Test  and  Measureraent  Error  for  Various  Socioeconomic  and  Racial  Groups 

< 

Garcia  (t 372) -posited  that  intelligence  tests  should  only  be  used  as  a  measure 
of  scholastic  progress.  In  cae  study  supporting  this  notion,  Israeli  Kibbutz,  children 
demonstrated  that  they  could  learn  to  do  better  on  tests  of  intelligence.  Garcia  in¬ 
dicated  that  the  idea  of  a  single,  general  intelligence  existing  independently  of  envi¬ 
ronment  is  doubtful  and  possesses  nc  social  utility.  According  to  Garcia,  designers 
of  intelligence  tests  incorporate  assumptions  which  make  the  tests  useless  for  compar¬ 
ing  different  ethnic  and  racial  vtc «.ps.  Garcia  concluded  that  intelligence  tests  are 
part  of  social  conspiracy  tc  promote  the  status  quo. 

Gael  and  Grant  (1972)  investigated  the  relevance  of  employment  te-,ts  to  actual 
job  performance  in  an  effort  to  generate  culture  fair  tests  that  would  predict  potential 
for  a  service  representative  position.  Minority  (n~  107)  and  norrainority  (n=  193) 
subjects,  newly  hired  in  the  position,  were  given  tests  related  to  specially  developed 
proficiency  criteria.  The  results  indicated  that  individual  and  composite  test  and 
criterion  averages  obtained  by  the  two  samples  differed  significant,.,.  ,  but  that  the 
validity  coeff ici  — ts  were  cctnarable.  Regress icn  equation  caparisons  indicated  that 
common  test  standards  could  be  ’used  to  evaluate  minority  and  nonaainority  job  applicants. 

Green  (1372)  examined  racial  int  iligence  differences  fror  the  genetic  and 
environmental  points,  of  view.  Eleven  hundred  and  twenty-seven  Puerto-Rican  subjects 
were  placed  into  five  groups  along  a  skin  color  continuum  (group  1  was  white,  group  i» 
was  black,  and  the  other  groups  were  in  between).  Intelligence  test  results  indicated 
a  similarity  between  groups  1,2,  and  3.  Groups  4  and  5  scored  lower  than  these  groups. 
These  results  were  said  to  parallel  the  social  situation  in  Puerto  Rico,  where-  obvious 
blacks  arf  victims  of  racial  prejudice,  while  non-whites  of  lighter  skin  are  accepted 
as  white.  Green  compared  the  Puerto  Rican  situation  tc  that  of _ the  United  "tates, 
where  the  prejudice  line  is  drawn  between  whites  and  non- whites  (between  groups  1  and 
2).  The  conclusion  drawn  was  that  intelligence  test  scores  follow  the  prej .dice  line 
move  closely  than  the  genetic  line. 

Mercer  (1972)  explored  three  probleres:  (I)  what  is  cental  retardation  and  how 
does  the  confusion  of  criteria  affect  the  labeling  of  minority  groups?  {2)  does  the 
so  called  regarded  2.0.  affect  m  person’s  ability  to  get  along  with  ethers  and  to 
cope  with  the  environment?  (3)  dees  the  different  cultural  backgrounds  of  blacks  and 


* 


I 


Chicanos  have  anything  to  do  with  lower  I.Q,  scores?  Tests  and  questionnaires  were 
administered  to  retarded  exults  and  children  ifi  a  medic®  size  city.  It  was  concluded 
that:  (1)  persons  of  lower  incase  groups  are  bo re  likely  to  he  labeled  as  retarded , 

(2)  raw  I.Q.  scores  are  invalid  without  soa*  test  of  environmental  adapt  at  ion  {many 
minority  people  classified  as  retarded  are  well  adapted  to  their  environment),  and 

(3)  I.Q.  tests  are  really  measures  of  the  degree  of  indoctrination  irto  the  angio 
culture. 


The  SPSS I  <1558 >  statement  oa  race  and  intelligence  reafflrrs  the  idea  that 
racial  differences  in  inteiliget .’c  are  sot  innate.  Research  has  indicated  that  the 
acre  similar  the  background  across  write  and  black  groups,  th»-  more  similar  the  in¬ 
telligence  test  scores.  Because  of  various  farms  of  subtle  discrimination,  blacks 
can  n'-t  lea.  the  same  type  of  lif  %  as  whites  who  possess  cosparable  class  .backgrounds . 
Only  when  equality  has  existed  for  several  generations  car  we  begin  to  compare  racial 
differences  is  intelligence  test  performance. 

In  anrlner-  study  (Sill lass s,  1*72),  results  of  the  Iowa  Tests  of  Basic  Skills  were 
correlated  with  race  and  parental  income  level  for  public  school  children-  It  was  deter¬ 
mined  that  while  race  could  account  for  scree  of  the  ach  juvesent  decrement*  ettrese 
poverty  < r-egardlsss  of  race)  was  a  sore  significant  variable  in  achievement  decrement . 

In  a  similar-  study  Southern  arc  Plant  (j.972>  found  that  young,  preschool  and  kinder¬ 
garten  children  of  poverty  stricken  failles  dcssonstrated  deficient  general  intellec¬ 
tual  and  language  afcilities. 

Sreen  and  Rohwer  (1971)  examined  the  relationship  between  socioeconomic  status 
(S£S)  and  several  academic  and  learning  measures -  They  found  that  SES  was  related  to 
a  digit  span  test  score,  and  *o  the  Raven  Progressive  Satr ices  test  score.  SES  was 
not  related  be  paired  associates  learning-  SES  was  also  found  to  be  related  to  long 
teres  school  learning  measures,  an  I.Q.  measure ,  a  complex  profciea  solving  measure, 
and  an  immediate  ©ssory  task.  *ks  relationship  was  found  between  SES  and  a  neasure 
of  short  tens  learning. 
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Vineberg,  Taylor,  and  Cay lor  (1970)  and  V iceberg  and  Taylor  ( 197 2a,  1 9? 2b) 
compared  the  job  perfomartc  of  men  in  five  different  specialties  at  several  AFQT 
levels.  They  found  that  performance  test  scores  were  related  to  both  AIQ?  scores  and 
job  experience.  Job  experience,  though,  accounted  for  considerably  sore  of  the  per¬ 
formance  test  ra  _^nce  than  7 Id  AFQT  scores-  Also,  there  was  considerable  overlap  in 
oerfoz .'lance  tes’  scores  at  different  AFOT  levels.  These  writers  suggested  that  a  large 
number  of  potentially  good  performers  are  lost  to  the  Army  because  of  low  AFOT 
scores.  In  addition,  the  job  performance  test  scares  of  whites  and  blacks  were  not 
significantly  different  even  though  blacks  had  lower  average  AFQT  scores. 

Such  (1972)  evaluated  2G  tests  in  a  business/industriau  setting  to  determine 
whether-  or  hot  the  tests  possess  differential  validity  for  blacks  and  whites. 
Separate  statistics  were  computed  for  both  blacks  and  whites.  Analysis  involved 
significance  tests  of  homoge-eity  of  regression  lines,  standard  error,  and  intercepts. 
The  results  indicated  no  evidence  of  differential  validity  for  regression  line  and 
standard  error  parameters.  Significance  date  ft  os  the  intercept  parameter  indicated 


that  black  performance  was  cverest  lasted,  and  that  tL  tests  vssx  unfair  tc  -wuit«s. 

Ruch  concluded  that  following  OFCC  and  EEOC  guidelines  involving  eossson  art/ if  a  '.ores 
and  prediction  procedures  reduces  eaployffient  opportunities  for  blacks. 

Hunt  (1971)  reports  that  sost  studies  find  that  the  proportion  c  "  £a<c%u : 
test  score  variance  that  is  attributed  to  heredity  is  roughly  SO  par  cent.  Thus. :  stud¬ 
ies  have  led  to  the  belief  that  racial  and  ethnic  differences  in  intelligence  au'e  fixrc 
at  birth.  This  notion  is  contrary  to  evidence  derived  from  the  plasticity  of.  beifivisr 
research.  In  this  research,  behavior-  sesss  to  be  highly  dependent  upon  aanipuiat  ions- 
of  the  envirosaent . .  The  plasticity  studies  indicate  that  early  eavirm&e&t  accounts 
for  over  50  per  cent  of  the  behavioral  variance.  Bunt  feels  that  heritSuOility  only 
tells  the  variance  of  genotypic  intelligence  for  children  reared  in  a  specific  static 
environment .  It  doesn't  indicate  the  changes  in  intelligence  that  can  take  place  assoag 
children  reared  in  different  environments  with  varying  educational  perograss .  When 
many  of  the  socioeconcaic  and  ervirasHaentai  differences  are  controlled  across  race  and 
class  membership,  achievement  and  intelligence  test  score  differences  tend  to  dis¬ 
appear. 

Humphreys  (1972)  discusses  the  various  aspects  of  test  fairness  with  respect 
to  staking  judgments  about  individuals  fresa  differing  groups.  First,  test  decisions 
should  be  sade  only  with  ©axisaliy  valid  tests.  Decisions  should  never  be  based  on 
short  tests.  Second,  Kisapfereys  Indicates  that  we  can  never  achieve  tests  with  perfect 
fairness  due  tc  regression  problems.  We  can,  though,  fit  the  regression  lies  to  the- 
data  in  cases  in  which  the  deviations  frea  the  regression  line  are  saall.  Third, 
there  ia  little  difference  in  slopes  aid  intercepts  across  various  denographic  groups 
such- as  i:tdicated  by  ®ost  data.  Use  of  a  single  regression  line  therefore  causes  no 
appreciable  unfairness  in  drawing  inferences  about  performance  criteria.  The  amount 
of  error,  in  fact,  is  generally  less  than  the  ssspling  errors  of  the  regression  coef¬ 
ficients.  Finally,  Hussprxeys  contended  that  the  same  psychological  principles  apply  to 
both  whites  and  blacks  since  the  differences  across  both  groups  have  not  been  too 
large.  If  not ,  Macks  and  whites  mist  be  considered  to  represent  different  species. 

Humphreys  indicated  that  all  of  the  afor-esenticmed  ccassents  apply  to  achievement 
tests.  Measures  of  intelligence,  though,  which  depend  upon  the  opportunity  to  learn, 
are  highly  unfair.  Accordingly,  tests  are  better  at  predicting  behavior  than  at  mak¬ 
ing  inferences  about  theoretical  constructs  like  learning  ability. 
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Personality,  Achievement,  and  Self-Concept  Factors  Affecting  Test 
Scores 


&rtz  (19ftS),  in  a  critique  of  personality  deficit,  or  cultural  deprivation 
theories  of  underachieveasnt ,  suggested  that  two  factors  account  for  cost  of  the 
black  achievement  variance:  (1)  sense  cf  environmental  control,"  and  <2)  "school 
anxiety."  Both  have  their-  source  in  e. .rly  familial  experiences,*  and  both  seer, 
to  be  modifiable  by  other  (later)  school  experiences.  According  to  Katz, -other 
factors ,  such  as  the  '’father  absent"  ana  "n-aehfsvesneist"  hypothesis,  tend  to  be 
unrelated  to  black  school  ach I ev  event .  Kata.  also  criticized  the  "cultural  conflict" 
hypothesis,  or  the  idea  that  competence  in  the  black  culture  is  irrelevant  to  the 
culture  of  the  school.  Research  has  shown,  t sough,  that  the  "culture  conflict” 
hypothesis  reduces  to  expectancy  of  attaining  goals  in  a  given  culture,  not  in  the 
choice  of  tbe  culture  to  which  one  strives  to  achieve. 

The  three  studies  that  are  discussed  in  the  following  sections  produced, 
essentially,  incompatible  results.  All  of  these  studies  attempted  to  relate  self- 
concept  "o  either  race  or  SES-  One  of  the  studies  achieved  positive  results  (Lefebre, 
1971),  one  achieved  inverse  results  (Scares  S  Soares,  1971),  and  one  achieved  negative 
results.  Perhaps  if  investigators  ir.  this  area  would:  (a)  agree  on  a  definition  of 
seif -concent-  (b)  use  a  coEsaon  measure  of  self-concept,  or  (c)  use  a  common  research 
strategy ,  equivocal  results  could  be  avoided. 

befebr..  (1971)  administered  the  Tennessee  Self-Concept  Scale  to  4G  white  and 
to  43  black  junior  high  scb-ucl  students  who  were  matched  for  age,  I.Q. ,  and  SES. 

Blacks  tended  to  score  significantly  lower  on  the  following  scales:  (a)  total 
positive  self  esteem  (p<-01),  (b)  behavioral  self -concept  (p<-01),  (c)  personality 
integration  (p<.OlK  (d)  ethical  self-concept  (p<.05),  (a)  identity  (p<t.05),  {£)  self- 
satisfaction  (p<-0S),  and  (g)  general  maladjustment  of  self-concept  (p<.01). 

Soares  and  Soares,  (1971 )  fotsid  that  disadvantaged  children  view  themselves  and 
think  that  rers  view  themselves  sore  positively  than  do  advantaged  children. 
Secondary  school  students  (both  advantaged  and  disadvantaged)  exhibited  a  diminished 
self-concept  when  concaved  with  elementary  school  students .  Soares  and.  Soares 
suggested  the  possibility  that  this  phenomenon  is  due  to  pressures  of  higher  level 
education. 

Get singer ,  Kunce,  Hiller,  and  Vcinberg  (1972)  attempted  to  relate  three  measures 
of  self-estee®  to  SES,  welfare  status,  race,  educational  advancement,  and  sex.  A 
group  of  133  sixth  grade  urban  children  was  used  as  subjects.  The  results  indicated 
that:  (1)  none  of  the  measured  personal -social  characteristics  were  correia  -.d  with 
.  self-estee®  as  sea:: ured  by  the  Coopcrssiith  Inventory  scores,  (2)  all  seasur-  uut 
educational  advancement  (r-  .17,  p<.05)  were  unrelated  to  the  Soares  and  Soares  in¬ 
ventory  ,  and  (3)  significant  correlations  were  found  between  Zilier  inventory  scores 
and  SES  (r=  .13),  race  (r=  .18)j,  and  education  (r=  .17).  Welfare  status,  SES.  race, 
and  education  v,xre  all  significantly  interrelated ,  but  sex  was  relates  only  to  educa¬ 
tion.  These  writers  concluded  that  race  and  SES  are  : neons equen t ly  related  fc  self- 
concept  in  the  given  age  group. 


*Tbe  effects  of  early  experience  on  the  academic  achievement  of  blacks  is  Bore 
cosspletely  discussed  in  Siegel  and  Bergsan  (1972). 
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Abramson  {19?i )  matched  22  ninth  grvde  black  subjects  frors  an  integrated 
school  with  22  black  subjects  from  a  segregated,  1  on  academic  achievement. 

Level  of  aspiration  was  measured  by  a  d Igit-lec rer-aubsticutian  test,  h  IJans- 
Whitne:*  U  test  indicated  hat  toe  integrated  3tude*zts  had  significantly  higher 
aspiration  levels  <p<.05j!  than  the  segregated  students.  A  nonstatist ; ca 1 ly  signi¬ 
ficant  cerrelat icn  of  .18  was  found  between  academic  achievement  and  aspiration  levels 
for  the  integrated  gruup,  while  a  statistically  significant  correiatioa  cf  .52 
(p<.CJ  j  was  found  between  these  sa&e  two  variables  for  the  segregated  group, 
dbraasos  suggested  that  the  integrates  students  are  unrealistic  goal  setters,  or 
perhaps  their-  high  aspiration  is  indicative  of  future  academic  achievement . 

Veto ff  and  Feele  (1983}  observed  that  black  sales  who  transferred  from  an 
all  black  school  to  a  white  school  increased  their  achievement  sotivstioc  (p<.&5), 
while  black  sales  who  remained  in  the  all  black  school  scored  the  sane  on  achievement 
motivation.  There  were  no  differences  with  regard  to  black  fes*.  across  both  types 
of  schools. 

Race  of  Proc  tor /Exper  isen  ter  or.  Test  Perfomaace 

Yando,  Z-igler,  and  Gates  (1371 )  administered  tests  designed  to  assess  social 
approach  and  avoidance  >  curiosity,  arid  intelligence  to  72  black  and  to  72  whits 
"lower  class"  children.  The  examiners  were:  three  "effective”  white  and  three 
"effective"  black  teachers,  and  three  "ineffective"  white  ani  three  "ineffective" 
black  teachers.  The  results  of  this  experiment  indicated  that  the  perforcance  of 
the  children  was  influenced  by  individual  variations  in  the  persocal  characteristics 
of  the  adults7  race-  for  example,  achievement  <&  intelligence  tests  fts  a  function 
of  the  effectiveness  cf  the  teacher,  and  was  unrelated  to  race.  These  Influences 
were  relatively  constant  for-  both  black  aad  white  children. - 

Puvky  and  Santa  (19721  exasinecs  the  differences  in  the  way  In  which  liack  and 
whits  experimenters  interact  with  black  and  white  subjects,  and  the  effect  of  the  in¬ 
teraction  on  test  schievesesit.  Thirty-six  Kegro  and  3§  white  preschool  children  were 
the  subjects.  Both  white  and  black  subjects  obtained  higher  scores  with  white  experi¬ 
menters  than  with  Kegro  expev-isenCers,  White  experimenters  were  rated  as  providing  a 
core  positive  social  atasosphere  than  «egro  experimenters  fer  both  groups  of  subjects. 


Gould  and  Klein  investigated  the  effects  of  black  and  white  testers  on  the 
perfortsance  of  black  (r.=  *+6)  and  white  (r<=  38)  students  in  a  racially  aired  testing 
situation.  These  researchers  found  that  black  subjects  perforaad  as  well  with  white 
test  adain istretors  as  they  did  with  black  test  administrators  or*  both  timed  and  un¬ 
tie;  ed  intellectual  tests.  On  social,  art itmiinal .  and  personality  measures,  though, 
blacks  presented  themselves  more  positively  when  tested  by  a  white  than  when  tested 
by  a  black  test  adsinistratcr. 
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Watson  (1972)  studied  the  effects  of  several  stressful  interpersonal  variables 
on  the  I.Q-  scores  of  black  children.  The  sanipuiated  variables  ir-  this  study 
(1)  face-to-face  relations  with  the  test  sdainistrator,  {2)  knowledge  of  the  test's 
purpose,  and  (3)  knowledge  that,  their  scores  would  be  compared  with  the  scores  of 
white  children.  Uatson  concluded  that:  O)  stress  itself  can  ic:%r  I.Q.  scores, 

{21  repressed  agression  in  the  presence  of  whites  can  lower  I.Q.  scores,  aad  <3) 
white  test  administrators  ere  part Iculsr ly  threatening  to  younger  black  children. 


Programs  for  Disadvantaged  and  Ghetto  Youth 

Tobias,  Busby,  Greenfield,  Goldberg,  and  Hcml  (1%8)  described  the  techniques 
used  by  several  antipoverty  agencies  to  prepare  tbs  hardcore  unemployed  for  Industry. 
Mobilization  for  Touch,  Inc.  views  such  factors  as  arriving  to  work  on  tise  as  a 
job  skill  requiring  training  rather  than  as  a  trebles*  for  the  employer,  {fee  sethod 
to  condition  a  person  to  arrive  at  work  or.  rice  is  to  allow  a  time  range  of 
arrival  from  5:06  a  to  9:30  s.E. ,  but  to  allow  the  early  arrivals  to  use  the  new 
ard  better  equipnent  and  the  late  arrivals  to  use  old  equipment.  Eventually  9:30 
a-m.  would  be  the  expected  arrival  tise.  Mobilisation  fer  youth  sees  two  problsas 
ir.  transferring  free  training  to  the  job:  H  )  verbal  diff  icuiti.es,  and  (2)  treass- 
ferral  of  work  dOr.-e  by  TCbiiitation  for  youth  to  the  real  job  situation. 

Youth  ir.  Action  attempts  ;r  resociailse  disadvantaged  youth  by  training  the® 
for  jobs,  teaching  than  not  to  hare  themselves,  a  d  because  of  this  hate  engage  in 
destructive  behavior.  Ir.  this  progras,  is  order  that  trait-  perspective  will  be  in¬ 
creased,  youths  are  instructed  to  unknown  aspects  of  the  culture  such  as  theatre 
and  auseisas . 

The  Hew  York  City  Board  of  Education  Job  Counseling  Centers  posit  that  per- 
ccoaliaation  of  ail  irrcerac ; ions  with  the  disadvataged  is  a  rust,  these 

individuals  feel  th?t  they-  ore  "ron-persor.s . ”  At  the  job  counseling  centers:  (*} 
the  clinical  ataosp&ere  is  lessened,  (h)’  waiting  is  kept  to  a  oinitauss,  (c)  doors 
are  kapt  open,  Id/  dr-ess  is  inforsal,  (e)  there  is  a  group  coffee  hour  every  evening, 
and  (f)  counseling  is  available  when  needed. 

Skin  Advancement ,  loc.  deals  with  the  underespioyed  who  are  circcdy  eap-oved 
at  the  lowest  level  ir.  industry.  They  train  management  as  wall  as  the  sr.ployees 
zs  order  to  create  a  ciiaat®-  is  which  tire  verxer us-s  advance.  These  employees  are 
given,  skill  training,  htsssn  relations  training,  cossuni cation  training,  and  legal 
aid  counseling. 

The  Kil&er-  Job  Corps  Center  trains,  clothes,  homines,  and  feeds  1~&0  stud  eats 
at  a  cost  of  $5,200.03  per  student .  On*.  feature  of  their  program  is  a  daily  group; 
dynamics  session  in  which  boys  get  together  usd  talk  about  their  profcles-s.  or 
matters  occurring  during  the  day  which  they  did  not  understand. 
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Clark  (1968)  indicated  that  industry  sust  understand  the  following  about  ghetto 
youth:  (l)  he  learns  to  survive  early  and  on  his  oa*u  msd  slat  whites  consider 
legitii^ate  is  not.  taken  seriously  by  the  disadvante  jed  youth,  (2)  he  knows  that  he 
is  a  cast  off  free  society  and  he  does  not  expect  anything  to  change,  (3)  he  does  not 
trust  society’s  promises  e’»en  when  aade  by  blacks,  (h)  he  will  try  to  take  advantage 
of  whatever  gimiefcs  he  learns,  (5)  he  has  insight  into  the  iamorality  of  society, 
(6)  be  is  oversensitive  to  criticism,  "because  he's  not  sure  that  be  is  not  actually 
inferior,"  (7)  be  will  always  test  other's  acceptance  of  hi®  through  hostility, 
withdrawal,  and  outlandish  consent  testing.  Clark's  re -cessend&t ions  to  industry  were: 
{a)  offers  made  by  industry  to  the  disadvantaged  must  be  serious,  genuine,  deliverable 
(2)  the  disadvantaged  youth  aust  be  trained  in  his  real  deficiencies:  reading  and  eath 
or-  fee  will  see  industry  as  insincere,  thus  coafiraing  his  expectations ,{ 3}  structural 
standards  of  performance,  The  same  for  white  and  black,  are  needed, (9)  realistic 
rewards  for  achieving  standards  should  be  given  frequently^  S)  the  black  youth  Bust 
have  the  sane  opportunity  for  upward  mobility  as  whites, (6)  the  black  youth  wants 
to  be  accepted  as  aa  individual  as  are  whites, and  (7)  the  black  want  the  same  frv-edoe 
of  choice  as  whites. 

Chappell,  Brenner,  Caamond,  Harrison,  Lee,  Stone,  and  Reade  (1968)  discussed 
tie  pregraas  of  several  cersparien  to  utilize  the  hard  cere  unemployed .  The  Xerox 
Ccrpcrat ion's  project  "stsp  up"  involves  a  19  week  training  program  for  disadvantaged 
ghetto  residents.  The  tine  is  equally  divided  between  training  and  work.  Participants 
are  placed  in  regular  jobs  as  soon  as  they  pass  the  employment  tests.  Eighty-nine 
per  cent  of  the  first  lt>  enrolled  in  the  program  were  able  to  complete  ‘it. 

u>ckheed’s  pregraa  (Chappell,  et  ai,  1968)  for  the  disadvantage!  is  based 
on  tns  following  tenets:  (J  )  train  for  specific  jobs, (2)  develop  self  confidence  by 
breaking  jobs  down  ar*d  rewarding  the  learning  of  each  step, (3)  teacb  using  demonstra¬ 
tion,^)  require  good  work  habits, (5)  provide  counseling , ( 6 )  recognize  individual 
differences,^ )  train  to  high  standards, (S)  household  heads  make  the  best  trainees, 
and  guarantee  a  job  and  pay  the  trainee  during  training. 

Xu  the  3?ndia  Corporation  (Chappell  et  al,  1963)  sensitivity  training  end 
coasan i cat  ions  training  was  stressed  with  a  staple  of  23  Jfexican-Aaericaa  women . 

After  ispiesentatica  of  these  techniques  job  satisfaction  and  performance  were  found 
to  improve - 

Kestinghoase's  (Chappell  et  al,  1958)  recansendat ions  tor  the  training  of  the  dis 
advantaged  in  industry  are  that  there  should  be:  (1)  honesty  in  the  progras»,(2) 
rewards  during  training, (3)  individualised  and  relevant  training, (4)  dynamic  training, 
(5)  integration  of  counseling  and  training ,( 6)  learning  of  the  factory  culture, {7 1 
avoidance  of  pateraalicE:  and  (8)  training  of  plant  and  peer  personnel  to  accept 
and  work  with  the  disadvantaged. 


Cognitive  Style 


Flaugher  end  Rock  (1972/  investigated  the  differing  patterns  of  abilities 
among  hign  school  sales  of  black*  whita,  Bex ican-A£ss”icaa ,  and  oriental  ethnic  groups 
or.  a  nultitest  aptitude  battery*  The  results  indicated  a  significant  similarity 
among  the  factor  loading  patterns  across  the  different  ethnic  and  racial  groups. 

F laugher  and  Rock  concluded  that  essentially  the  same  patterns  of  ability  exist 
regardless  of  +he  ethnic  identity  of  the  examinee. 


Fifth  and  sixth-grade  students  (n=  356)  free  a  suburban  school  s’ere  tested 
on  the  Rar. er<  Progr^ 'sive  Matrices  to  examine  the  internal  consistency  and  culturally 
related  error  r.s  of  the  test.  Differential  £.roup  error  patterns  were  found. 

This  finding  svggests  .^at  the  lower  performance  levels  of  lower  class  subjects  in 
general  and  the  ■»  .orer  performance  of  lower  class  blacks  say  have  been  due  to 

perceptual  rigiai  er  than  a  lack  of  reasoning  ability  (Bartlett.  Wevhrough*  6 

TiXJcla,  1972). 

Wilcox  (1971 j  rou.id  that  60  ';s advantaged  black  college  students  exhibited  a 
qualitatively  different  associative  style  fro®  42  white  college  students.  The  associa¬ 
tions  of  the  black  subjects  shoved  restricted  word  association  hierarchies,  a  lack  of 
unique  associations,  and  a  prevalence  of  opposite  aad  syntagmas tic  associations  in 
response  tc  target  words.  Wilcox  concluded  tost  the  word  associations  of  blacks  show 
c  "deveuopEentai  lag  or  deficiency"  when  compared  with  the  word  associations  of  whites, 
ire  suggested  training  in  assvtiat *’  e  skills  as  one  means  of  eliminating  the  differences 
found . 

Hallahan  (1979)  discussed  tesearch  evidence  demonstrating  that  black  disadvan¬ 
taged  children  respond  impulsively  rather  than  attentively  tc  environmental  stimuli. 

Me  indicated  blacks  to  posse  is  a  shorter  attendee  span  and  tc  react  with  hy  er- 
activity.  This  according  to  Kaiiahan,  is  due  to  restrictive  child  rearing  practices 
and  a  disordered  environtsent  which  prevents  the  subjects  fr-os  learning  to  res,  end 
selectively  arc  appropriately  tc  stimuli-  Thus,  the  behavior  of  the  disadvantaged 
child  was  compared  .tc  that  of  the  brain  damaged  child.  Haliahan  rec'-ssended  training 
the  disadvantaged  child  ir,  attention  skills  arid  structuring  the  environment  to  increase 
concentration  and  decrease' sistt iction  i ro©  extraneous  stimuli. 


Tee  research  diseased  Sn  toe  c-resent  chapter-  supports  the  conclusion  that  a 
significant  proportion  tf  the  variance  of  test  score  differences  for  different  group 
can  be  accounted  for  by  environmental  and  training  differences .  The  results  re¬ 
peated  herein  do  not  warrant  the  conclusion  that  race  of  the  experimenter  has  an 
effect  upon  studec--  test  performance,  fesvever-  the  personal  qualities  of  the  experi- 
center /proctor  appear  .o  have  scsse  effect  on  test  scores.  The  importance  of  train¬ 
ing  deprived  youths  in  basic  skills  sod  work  habits  e.cA  of  orienting  industry  to 
accept  the  •uiderdeveloped  «ere  stressed  in  the  section  on  or-ograss  for  training 
underdev elc ped  youth.  Finally,  cognitive  style  has  been  criticized  fey  scse  as  an 
explanatory  concept  which  accounts  for  the  test  sc ore  differences  soaet Loe3  yielded 
by  different  groups.  The  use  of  differential  training  strategies  is  suggested  as  a 
naans  for  eliminating  test  score  differences  assong  t:  ese  different  groups. 


CHAPTER H 


METHODS 


The  miniature  apt it odd  test  concept  forced  the  basi t  for  the  present  research 
effort.  The  s-iniature  aptitude  <.onc-;pt  is  embedded  within  a  siniature  training  and 
evaluation  situation.  To  develop  a  battery  of  siaiature  a-ptitude  tests  fo"  *he  machinist 
rate  rating,  the  tasks  typically  performed  within  the  rating  were  likens ionalized  using 
job  analytic  techni'  jes.  Sh  >rt  training  sessions  (15-3C  sinutes)  were  then  bunt  around 
the  identified  lob  dimensions.  These  training  or  learning  sessions  enphar I ted  psrrora- 
ance ,  observation,  and  rrac.ice  rather  than  reading  and  writing.  Once  training  vs*.s 
completed,  the  evaluation  phase  cossseneec.  The  evaluation,  of  course,  consisted  of  the 
miniature  aptitude  tests  which  measured  the  amount  learned  in  the  training  or  learning 
j-hise.  Each  ©ini»*ture  test  consumed  no  store  than  5-20  sinutes  of  adtsi nistrat ion  tise* 
Ninety-nine  low  a jtitude  Navy  recruits  participated  in  the  training  evaluations.  The 
rationale  for  the  procedure  is  that  if  the  recruit  can  learn  to  per for®  cne  variety  of 
job  related  tasks  presented  it  the  alniature  training  situations,  he  can  also  learn  to 
oerfons  similar  tasks  through  on-the-job  training  in  the  Fleet .  The  details  of  these 
steps  were  described  in  the  trio*'  report  (Siegel  5  Bergman,  1372 )  of  this  series.  For 
purposes  of  continuity,  these  s  etc  are  also  reviewed  in  subsequent  sections  cf  the 
present  chapter. 

The  ninety-nine  recruits  were  assigned  to  Fleet  ships  for  work  in  the  machinist 
sate  rate.  Nine  months  later,  a  set  of  job  sample  performance  tests  was  adair.i  ;tered 
to  de-ermine  the  predictive  power  of  the  sinirture  aptitude  tests.  The  methods.  and 
result'  of  this  firs-  validation  constitute  the  principal  focus  of  the  present  report. 

The  machinist  ©ate  specialty  was  selected  as  the  vehicle  for  a cccfto  i  t  sru ng  tr.e 
goals  of  the  present  study  fce.-iase  this  ’.'ate  involves  performance  of  tasks  which  are., 
largely  nonverbal  in  nature.  In  addition,  this  rate  is  one  in  which  there  is  net  a 
large  rusher  of  clacks .  Nonetheless,  it  it  a  rate  which  should  be  attractive  to  aoat 
recruits  since  it  is  aoecuateiy  high  on  the  ir, formal  prestige  scale  for  various  Navy 
jobs  and  because  it  offers  the  potential  for  learning  skills  .which  can  ieau  to  post 
Navy  employment. 

The  normal  entry  into  the  machinist  mate  career  field  in  the.  Navy  is  through  the 
Navy  school  for  &iachini.  t  nates.  This  school  involves  training  in  the  fundamental 
skills  and  knowledges  required  for  performance  at  entry  level  in  tne  Fle-t. 
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Subjects 


The  subjects  were  Navy  recruits  who  were  identifies  after  initial  Havy  testing 
at  the  Great  Lakes  Hava!  Training  Center.  As  a  basic  requirement  for  participation, 
a  subject  had  to  have  "failed”*  the  entry  tests  for  the  Machinist  Kate  School.  Kinety- 
s :  r.<  recruits  'ere  so  identified.,  fifty  of  these  recruits  were  white  and  49  were 
block.  Virtually  aul  of  the  subjects  were  between  19  and  20  years  of  age.  Ail  had 
sixpressiv  a  desire  to  enter  the  machinist  taate  rating  during  a  career  counseling 
interview.  The  subjects  were  sot  told  that  they  were  selected  for  a  special  study. 


Miniature  Job  Sample  Test  Development 

The  Machinist  Mate  sectioi  of  the  Havy  Manual  of  Qualification  for  Advancement 
:,n  Satiag  (NAVTESS  13068)  was  consulted  as  a  first  step  in  the  construction  of  the  aini- 
4*ture  training  aod  evaluation  situations.  Thcs£  practical  required  for  advance- 

neat  to  level  E-4  were  extracted.  .Several  of  these  behaviors  were  combined  because  of 
their  similarity.  The  next  step  involved  a  meeting  with  five  Master  Chief  Machinist 
Mates  and  one  Warrant  Officer  at  the  Great  Lakes  Haval  Training  Center.  During  this 
reeling,  a  final  list  of  b eha'-  iors ,  which  ware  held  to  be  idequately  representative  of 
the  sost  frequently  performed  or  critical  tasks  of  the  journeyman  level  machinist  sate, 
were  agreed  on.  The  six  behveioru  identified  were: 

1.  ability  to  identify  and  use  hard  tools  cosmon  tc  the  jon 

2.  ability  to  perform  maintenance  and  to  read  meters  and 
guages  accurately  when  under  sees  degree  of  distract!  r, 
or  when  attention  sharing  Is  involved 

j.  ability  to  make  sisple  repairs  in  pressure  lines 

4.  ability  to  perform  simple  troubleshooting  and  systems 
analysis  in  pressure  systems 

5-  ability  to  operate  squirmed  coraon  to  rate 

6.  ability  to  assemble  and  disassemble  cossson  high  failure 
frequency  items 

These  tasxs  formed  the  basis  for  the  air-iature  training  and  evaluation  situations. 
Several  Master  Chief  Machinist  Mates  then  served  as  technical  consultants  during  the 
actual  lessen  preparation  phase  of  the  project.  This  procedure  resulted  in  the  con¬ 
struction  of  six  tests  and  the  associated  training  situations.  These  training  and 
evaluation  situations  reflected  samples  of  the  six  eost  critical  and/Oi  frequently  per— 
fora*d  iahaviors,  as  listed  above,  or  the  entry  level  ssachinist  sate. 


*h  recruit  mist  exceed  a  combined  General  Classification  Tes,  (GCT),  Arithmetic  (MI), 
and  Mecl^aaical  (?©CH}  score  of  156  to  bo  eligif-’>  for  entry  to  the  machinist  sate 
"AM  school. 
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Each  situation  contained  two  parts.  The  first  part,  a  training  phase,  usually 
involved  a  "show  and  tell"  learning  situation  of  15  to  30  minutes.  Reading  and 
writing  ability  were  not  required  in  any  of  these  lessons  although  sense  of  the  tests 
involved  the  ability  tc  tell  time  and  the  ability  to  read  numbers.  These  are  considered 
to  be  prel iterate  requirements. 


Equipment  Use  and  Nomenclature 

In  the  equipment  use  and  notsenc iature  training  and  evaluation  situation,  th*. 
gcal  was  to  determine  whether  or  not  the  recruit  could  ieam  the  nasaes  and  uses  of 
all  of  the  equipment  and  material  involved  in  breaking-making  and  flange.  The  as¬ 
sumption  was  made  that  if  a  sailor  could  learn  the  cases  and  uses  of  the  tools  and 
materials  involved  in  this  situation,  he  would  also  be  able  to  learn  the  names  and 
uses  of  other  equipment  used  on  the  job.  After  a  tape  recorded  introduction,  the 
instructor  demonstrated  how  to  break  and  make  a  flange.  During  this  demonstration, 
the  use  and  name  of  each  piece  of  equipment  was  discussed  by  the  instructor.  For 
example,  the  instructor  would  hold  up  the  tool  and  say  "This  is  a  When  the 

tool  was  returned  to  the  instructor  he  woiuu  demonstrate  its  use  in  The  flange  repair- 
situation-  When  the  demonstration  was  completed,  a  23  item  true-false  test  was  orally 
administered  to  the  recruits.  For  each  item  in  this  test,  the  instructor  held  up  an 
object  ana  ascribes  a  name  or  use  to  it.  The  I'ecruits  then  indicated  whet he  *  the 
name  or  use  given  by  the  instructor  was  true  or  false  by  encircling  either  the  word 
"true"  or  the  word  "false"  next  to  the  item  number  cn  their  answer  sheets. 


G&sket-  Cutting  and  Meter  Reading 


The  gasket  cutting  and  meter  reading  training  and  evaluation  situation  was  designed 
to  investigate  ability  to  learn-  a  maintenance  task  and  to  perform  when  none  degree 
of  attention  sharing  is  involved.  This  situation  was  also  designed  to  sample  the 
vigilance  situation  in  which  the  machinist  mate,  on  the  job,  must  monitor  the  state 
of  various  equipment  systems  while  he  performs  other  tasks.  After  a  tape  recorded 
introduction,  the  subjects  were  taught,  through  demonstration,  how  tc  sake  a  gasket 
using  a  flange,  a  ball  pean  hammer,  asbestos  gasket  material,  and  some  bolts.  On 
completion  or  the  lesson,  the  subjects  were  given  a  10  minute  gasket  making  practice 
session.  During  the  practice  sessions,  the  instructor  circulated  among  the  recruits 
and  gave  assistance  when  required. 

Rext,  the  subjects  were  taught  how  to  read  a  pressure  meter  and:  (1)  how  to  log 
the  rise  at  which  the  pressure  deviated  from  normal  and,  (2)  how  to  know  whether  the 
pressure  should  be  adjusted  up  or  down  (relative  tc  a  given  nominal  valve).  A  large 
clock  with  a  sweep  hand  was  placed  in  front  of  tha  rocs.  The  recruits  were  able  to 
road  the  tise  from  this  clock.  The  tests  for  both  gasket  cutting  and  meter  reading 
were  administered  together.  That  is,  for  a  ten  minute  period,  the  subject  had  to 
observe  and  record  from  a  meter  while  he  constructed  a  gasket,  meter  which  each 

individual  read  was  placed  at  his  individual  work  static®.  Each  meter  was  individually 


driven  so  'chat  there  was  --  possibility  'or  a  subject,  who  noticed  an  abnormal  condi¬ 
tion  at  his  station,  to  cue  a  recruit  at  another  station  of  an  abnormal  condition  at 
the  seconi  station.  The  signal  presentations  to  all  stations  were  equated  for  number, 
direction,  and  magnitude  cf  deviation.  Completion  of  these  two  tasks  simultaneously 
introduced  the  attention  sharing  cesoonent  into  the  measurement  situation- 

The  gasket  making  was  scored  through  a  checklist  which  was  completed  bv  obervers 
as  the  recruits  oerfor-aed  the  task.  The  scoring  checklist  included  items  on  adherence 
to  correct  methods,  care  and  use  of  tools,  adherence  to  safety  precautions, and  adequacy 
of  the  final  gasket;  The  meter  reading  test  was  scored  the  basis  of:  (1)  number  of 
abnormal  conditions  correctly  noted,  (2)  correct  indication  of  the  direction  of  the 
necessary  pressure  adjustment  (up  or  down)  required  to  restore  the  system  to  normal, 
and  (3)  precision  of  the  log  entry  for  time  of  deviation  fr era  normal. 

Trouble  Shooting 

The  objective  of  the  trouble  shooting  miniature  training  and  evaluation  situation 
was  to  test  the  recrui  t  *  s  ability  to  learn  to  perform  sample  systems  analysis  and 
trouble  shooting  on  a  hydraulic  system.  '  simulated  pressure  system  was  used  as  the 
apparatus  for  both  the  training  and  testing  aspects.  After  a  taped  introduction,  the 
recruits  were  taught  bow  the  pressure  system  operates.  The  apparatus  consisted  or  a 
series  of  color  coded  interchangeable  gears  which  were  so  interconnected  that  a  sisu- 
xatea  pumping  system  was  driven .  k  set  of  valves  controlled  the  flow  within  the  simu¬ 
lated  pumping  system.  Accordingly,  tc  diagnose  a  malfunction  in  the  system,  the  recruit 
was  required  to  comprehend  such  olemercary  relationships  as:  (1)  the  effects  of  gear 
size  on  pump  speed /rate  of  flow,  (2)  -he  effects  of  direction  of  gear  rotation  on 
flow,  (3)  how  differential  gearing  can  produce  changes  in  output  rate,  and  {**)  the 
effects  of  valve  aiia  pump  function  on  system  operation. 

A  series  cf  light  indicators  was  used  to  signal  adequacy  of  flow  at  various 
parts  of  the  simulated  system.  The  recruits'  task  was  to  observe  the  light  indicators, 
to  determine  whether  any  malfunction  existed,  and  to  note  the  cause  of  the  condition. 

In  the  training  situation,  various  malfunction  situations  were  presented,  a,jd  the 
recruits  wer*  taught  what  parts  of  the  system  needed  adjustment  in  order  to  correct 
the  problem  and  the  logic- for  the  correction.  After  training,  the  subjects  were  pre¬ 
sented  wich  a  series  of  practice  trouble  shooting  situations.  For  each  problem,  both 
the  co  rect  answer  and  the  reason  for  it  being  correct  were  discussed.  - 

Twelve  problems  were  orally  presented  to  the  stud  snts  in  the  test  situation. 

Each  problem  presented  a  given  malfunction  indication-  The  subjects  were  required 
to  identify,  by  encircling  a  number,  the  system  component  which  would  cause  the  given 
aalfu action  indication. 


Equipnseat  Operation 


In  the  equipment  operation  miniature  training  situation,  the  recriit  wa;  ight 
to  start  up  and  shut  down  a  actor  and  puap  apparatus.  ,  The  students  were  requ avd  to 
learn  a  33  step  procedure,  including  several  safety  precautions-  Each  sub* -  ct  was 
then  given  the  opportunity  to  practice  starting  up  and  shutting  down  the  apt a^atus. 

After  practice,  a  checklist  type  perforsance  test  was  administered.  Scoring  was 
acccrpiished  during  the  subject’s  performance  and  was  based  on  adherence  to  correct 
procedures  and  observance  of  safety  precautions. 

Assembly 

In  the  assembly  training  and  evaluation  situation,  the  recruits  were  taught  and 
tes.ed  on  the  assembly  of  a  gate  valve  from  its  component  parts.  First,  a  c'estonstra- 
tion  of  the  correct  assemcly  procedure  was  presented.  This  demons t rat ion  was  followed 
by  a  practice  session  in  which  the  students  were  allowed  to  assemble  the  vaU-r  thes- 
selves.  The  instructors  observed  the  students  during  this  practice  period  and  nelped 
the®,  as  required.  After  the  practice  session,  each  recruit  was  individually  tested 
on  his  ability  to  assemble  the  valve.  Again,  scoring  was  through  the  checklist 
procedure. 

Pass  Fail 

A  subject  was  considered  to  have  "passed”  the  miniature  training  and  evaluation 
battery  if  he  scored  "average"  or  better  on  the  trouble  shooting  test  and  "average” 
or  better  on  two  of  the  five  remaining  tests.  These  subjects  were  assigned  to  a 
"predicted  successful”  category  and  randomly  assigned  tc  ships  in  the  Tleat  for  work 
in  the  machinist  mate  rate.  Subjec-s  not  meeting  the  "pass"  criterion  were  assigned 
to  a  "predicted  fail”  category.  These  subjects  were  also  randomly  assigned  to  ships 
in  the  -Fleet  for  work  in  the  machinist  mate  specialty.  The  results  indicated  that  there 
were  no  differences  in  the  prooort ion  of  whites  and  blacks  passing  these  tests. 

The  rationale  behind  the  choice  of  passing  scores  was  that  the  trouble  shooting 
miniature  training  and  evaluation  situation  was  cogaitive  in  nature,  while  the  remain¬ 
ing  situations  involved  the  learning  of  manipulative  procedures.  Hence,  those  in  the 
"  -edict ed  successful"  group  demonstrated  scse  cognitive  as  well  as  some  manipulative/ 

1  rocedural  skill  learning  ability  in  the  miniature  job  learning  situation. 

The  score  tf  each  recruit  in  the  sample  on  the  GCT,  AEX ,  KECH,  and  CLEF,  tests 
of  the  Navy  classification  battery  was  provided  by  recruit  classifies" ion  personnel 
at  the  Great  Lakes  tl aval  Training  Center.  These  data,  presented  in  our  earlier  report 
(Siegel  and  Bergman,  1972),  indicated  fairly  close  agreement  between  the  white  and 
black  groups  on  these  tests. 


Instructors  and  Test  Administrators 

Two  instructors/test  administrators  managed  each  training  and  evaluation  session. 
One  instructor/test  administrator  was  a  retired*  black,  Chief  Petty  Officer*  The 
second  instructor/test  administrator. was  a  white  psychologist  frees  the  Applied,  Psycho¬ 
logical  Services’  staff.  The  assigned  duties  of  th-  black  Chief  Petty  Officer  were: 

1-  taping  the  introduction  to  each  session 

2.  conducting  the  training  segment  of  each  lesson 

3,  administering  the  tests  to  black  recruits 

This  instructor  was  thoroughly  trained  in  the  content  he  was  to  present,  tue  teaching 
methods  he  was  to  employ,  and  the  test  procedures  prior  to  implementation  of  the 
present  program. 

The  duties  of  the  white  psychologist  were: 

1.  organization  and  direction  of  the  mini  job 
sample  learning  and  testing  program 

2-  training  the  black  instructor/test  administrator 

3.  assisting  the  black  instructor /test  administrator 
in  his  lesson  presentations 

4.  testing  the  white  recruits 

A  black  instruct or/test  administrator  was  used  because  his  edueati  ©,  personality, 
verbal  inflection*  and  method  of  treating  the  subject r  were  of  such  a  nature  that  he 
could  easily  be  identified  with  _;id  understood  fey  the  biac’"  recruits. 


3nvtiug 

All  of  the  miniature  training  and  evaluation  sessions  were  conducted  in  a  large 
classroom  provided  by  the  Machinist  Hate  School,  Great  Lakes  Naval  Training  Center. 
This  classroom  contained  12  student  desks  and  sis  work  tables  of  various  sizes.  The 
lighting,  ventilation,  temperature,  space,  and  privacy  were  considered  optimal  for 
this  type  of  study. 

All  sessions  began  at  0720  hours  and  were  completed  by  1500  hours.  Between  5 
to  12  recruits  were  .involved  during  each  training  and  evaluation  day.  The  total 
training,  practice,  and  testing  was  completed  for  any  one  group  within  the  time  indi¬ 
cated. 
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Quest ionnaire 


A  36  item  personal  background  questionnaire  was  constructed  by  Applied  Psycho¬ 
logical  Services  to  Beasure  various  facets  of  cultural  different  iat  too  including  * 

(a)  need  achievement,  (b)  hose  environment ,  (c)  school  environment ,  and  (d)  other 
demographic  variables.  This  quest iosna ire  was  administered  to  all  recruits  in  the 
miniature  training  and  evaluation  sample.  It  was  considered  that  these  cultural 
factors  could  moderate  learning  ability  to  the  extent  that  the  -correlations  between 
the  miniature  evaluation  test  scores  and.  the  ultimate  performance  criteria  would  be 
lowered . 

In  addition.  Applied  Psychological  Services  administered  the  same  quest icons' re 
to  a  control  group  of  machinist  sate  A  school  recruits,  who  had  net  or  surpassed  t  a 
required  GOT,  API,  i-*3  MUCH  test  score  for  the  machinist  mate  rating. 

The  questionnaire  administered  to  the  nlnialure  training  an*  evaluation  sample 
was  factor  analyzed.  Nine  factors,  accounting  for  46  per  cent  of  the  variance,  were 
extracted.  These  factors  were  called :  self-esteea,  environmental  stifcilation,  reading 
habits,  educational  attainment,  educational  initiative,  parental  interest,  monetary 
deprivation,  educational  encouragement,  and  urbanity.  Our  earlier  report  ia  this 
series  {fiegel  5  Bergman,  1972)  presents  the  actual  items  and  factor  loadings  for 
each  factor. 

In  order  to  test  whether  or  not  the  low  aptitude  group  differed  significantly 
on  the  cultural  differentiation  factors  from  the  A  school  group,  factor  score  scans 
for  the  high  aptitude  (A  school)  group  asi  for  the  low  aptitude  group  were 
calculated.  Tests  (*t"  tests)  were  conducted  between  the  group  isean  scores  for  each 
factor.  For  eight  of  the  nine  cultural  factors,  the  low  aptitude  group  differed 
significantly  from  the  high  aptitude  group. 

Finally,  is  order  to  determine  if  the  miniature  training  and  evaluation  tests 
were  less  contaminated  by  cultural  factor®  than  tne  4a vy  qualification  tests,  correlation 

coefficients  were  calculated  between  the  Savy  qua l if  i cation  test  scores,  the  miniature  iofc 
sample  tests,  and  the  questionnaire  factor  scores  for  the  93  recruits  in  the  Rlow 
aptitude^  sanpie.  Only  or.e  of  the  correlations  between  the  miniature  training  and 
evaluation  tests  and  the  cultural  factors  was  statistically  significant.  Alternately, 
seven  wf  the  correlations  b“-tweer.  the  Navy  qualification  tests  and  the  cultural  factor 
scores  were  statistically  significant-  Thewe  data  support  a  contention  that  the  mini 
tests  are  less  culturally  loaded  than  the  usual  Navy  quaiif icaticn  tests. 

Interview 

Qfi  cosplet ion  of  the  miniature  training  and  evaluation  situations, each  recruit 
was  interviewed  by  one  of  the  instructors /'test  sara  Ini senators  ia  order  tc  obtain 
reactions  to  the  entire  training  and  testing  program.  The  subjects  wp-e  asked  to 
compare  the  tests  asd  training  they  received  ir.  the  present  program  with  other  types 
of  tests  ana  training  they  received  in  the  past.  All  subjects  ware  encouraged  to 
respond  freely  aru  openly  tc  the  interviewer.  The  inter  lew  questions  were  constructed 
In  a  manner  permitting  quantitative  and  qualitative  analysis  of  the  interviewee  responses. 


The  results  of  this  interview  analysis  are  susssarized  in  the  following  paragraphs^ 

Seventy-eight  per  cent  of  the  recruits  thought  the  training  portion  of  the 
program  Has  'xetter  than  the  trainicg  they  had  received  in  other  programs.  Twenty 
per  cent  of  t5te  .subjects  thought  the  training  portion  of  the  progress  was  the  sajse 
as  the  trair/ijsg  they  had  received  in  other  training  programs.  Only  one  per  cent 
of  the  subjects  thought  that  the  training  they  received  was  worse  than  that  of 
other  training  program s.  These  results  tend  to  support  the  esphasis  placed  on 
performance  and  sanipulative  skills  during  the  learning  sessions  rather  than  on 
reading  and  writing. 


Eighty-six  per  cent  of  the  subjects  thought  the  miniature  job  learning  tests 
were  "better  than"  pej^er  and  pencil  tests.  Twelve  per  cent  of  the  subjects  thought 
the  progress  tests  were  the  "same  as"  paper  and  pencil  tests.  Again,  only  one  per 
cent  thought  the  program  tests  were  "worse  than"  paper  and  pencil  tests.  These 
results  support  the  use  of  performance  oriented  tests  which  require  littJs  or  no 
reading. 

Finally',  §8  per  cent  of  the  subjects  indicated  that  they  enjoyed  participating 
in  the  training  and  evaluation  progi-aa,  while  only  two  per  cent  indicated  that  they 
did  no;  enjoy  it. 

These  responses  and  the  results  of  the  first  two  questions  permit  the  conclu¬ 
sion  that  the  low  aptitude  Savai  recruits  show  an  overwhelming  preference  for  the 
training  and  evaluation  program,  as  employed  here,  over  the  acre  traditional  test¬ 
ing  approaches. 


_  .  _ -j  -  . . . . .  .  /*■ 

*  A  ©ore  complete  interview  analysis  was  presented  in  the  previous  report  of 
this  series  (Siegel  &  Bergman,  1972). 


CHAPTER  m 


CRITERION  DEVELOPMENT 

Quite  obviously,  freedos  free  cultural  bias,  equivalence  for  both  black  ansi 
white  groups,  and  the  like  represent  necessary  but  not  sufficient  ingredients  for  us 
assessment  approach.  Tee  predictive  validity  of  the  miniature  job  learning  tests  mast 
also  be  demonstrated .  To  this  end,  the  recruits  in  the  Hlow  aptitude*'  sample  were 
followed  up  after  they  had  served  6  to  S  souths  in  the  machinist  ®ate  specialty.  The 
followup  attempted  to  seasure,  through  the  work  sample  performance  test  approach, the 
ability  of  the  sen  to  perform  various  aspects  of  the  job  of  the  machinist  mate.  These 
criterion  data  were  supplemented  with  supervisor  ratings  and  supervisory  interview 
data.  Is  this  report,  we  place  primary  espbasis  on  criterion  referenced  followup 
tests  and  only  rainlsua  eaph&sis  on  the  supervisory  ratings  and  interview  data.  It  is 
known  that  supervisors  often  r=tte  black  job  incumbents  lower  than  whites-  flaugher, 
Campbell,  and  Pike  (1989)  found  that  white  supervisors  rated  a  group  of  Negro  incuabents 
one  half  a  standard  deviation  lower  than  Negro  supervisors.  Negro  supervisors,  though, 
did  not  rate  white  incumbents  higher  or  lower  than  white  supervisors. 

•  Nevertheless,  supervisory  evaluative  data  are  cons:  to  be  of  interest  in  the 

present  context. 


I 
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Criterion  Tests 

To  develop  the  criterion  tests  for  use  in  this  study.  Applied  Psychological 
Services*  oerso snel  first  reviewed  portions  of  Nf.VPERS  1SG88A  relevant  to  the  machinist 
mate  rate.  Several  pro_x>red  ideas  for  criterion  referenced  sin  month  fleet  performance 
tests  were  extracted.  The  next  step  was  to  elaborate  on  these  testing  possibilities 
with  experts  in  the  machinist  s&te  rate.  The  experts  in  this  si teat ion  were  primarily^ 
needed  to  supply  storable  testing  items ,  fruitful  test lug  suggestions,  'and  statements  of 
criteria  for  "acceptable-*  work  on  each  criterion  objective  after  six  months  experience 
as  a  machinist  sate  striker,  four  such  experts  were  made  available  by  the  Maval  Desage 
Control  Training  Center,  Philadelphia. 

These  experts  agreed  that  the  following  practical  performance  itens  would 
constitute  an  adequate  test. of  the  ability  of  a  machinist  sate  striker,  with  six  -months 
fleet  experience,  to  meet  normal  performance  expectations: 

1.  standing  messenger  watch  . 

2.  making  and  breaking  a  flange 

3.  packing  a  valve 

4.  procedures  in  cosaaon  malfunction  and  in 
emergency  situations 

5.  knowledge  or  use  and  name?  of  coseaoo 
equipment  ar>i  tools 

6.  general  alertness  and  coesaon  sense  in  the 
work  situation 


Three  separate  meetings  were  held  to  isolate*  derive,  and  define  the  performance 
objectives  ar-d  the  methods;  for  oeas-orlng  performance  on  these  objectives.  As  a  result, 
sis  nonverbal  performance  tests  were  defined .  Each  of  these  is  described  categorically 
below. 


Messenger  Watch 

The  aesseager  watch  examination  tested  «<aainee’s  ability  to  record  accurately 
data  and  to  determine  malfunctions  indicated  by  the  data.  Pictorial  simulations  of 
throttle  board  situations  were  presented ,  one  at  a  time,  *:o  the  exaainee.  The 
examinee  was  requlr-ed  to  record  accurately  data  fro©  the  caters  and  gauges  depicted 
on  the  throttle  board  simulation.  The  examinee  was  also  required  to  report  to  the 
exasiner  those  throttle  hoard  readings  which  indicate  an  abnormality  or  malfunction  in 
the  systers.  f 

Thus,  csefe  examinee  received  two  .scores.  One  score  was  based  on  his  degree  of 
accuracy  in  recording  data  fro®  the  simulated  throttle  board  pictures;  the  second 
score  indicated  his  ability  to  detect  malfunctions  on' the  basis  of  given  data. 

The  directions  to  the  nessenger  watch  test,  which  were  given  orally  to  each 
exasinee  prior  to  testing,  are: 

"THIS  IS  A  TEST  OF  YOUR  ABILITY  TO  COPY  NUMBERS  FRO*  MET EPS  AMS 
GAUGES  ONTO  A  LOS  SHEET.  THIS  IS  ALSO  A  TEST  OF  YOUR  ABILITY  TO  T Ed,  US 
IF  THERE  IS  SOMETHING  WRONG  WITH  THE  GAUGE  OS  METES  READING.  Y*j  MILL 
SE  PRESENTED  WITH  A  SERIES  OF  PICTURES  SHOWING  COSMOS  METERS  AHS  GAUGES 
FOUND  OK  A  THROTTLE  BOARD.  ME  WAN?  YOU  TO  RECORD  THE  READING  GS  EACH  OF  . 

THESE  METERS  AND  GAUGES  OH  A  LOG  SHEET  WHICH  ME  MILL  GIVE  YOU.  HAKE  YOUR 
READINGS  AS  ACCURATELY  AS  POSSIBLE.  ALSO,  SE  WANT  YOU  TO  TELL  US  WHICH 
METERS  AND  GAUGES  INDICATE  AN  OUT  OF  NORMAL  CONDITION  WELCH  SHOULD  BE 
REPORTED.  THE  PICTURES  YOU  SILL  SEE  SHOW  A  600  POUND  STEAM  SYSTEM.  THIS 
MEANS  THAT  THE  MAIN  STL  AS,  STEAM  TO  TURBINES,  AND  AUXILIARY  STEAM  SHOULD 
ALL  READ  ABOUT  600  POUNDS,  ALL  OTHER  BET EES  AND  GAUGES  ARE  READ  THE  SAME 
AS  ABOARD  YOUR  SHIP.  ME  ONLY  HAST  YOU  TO  LOG  YOUR  READINGS  FOR  THE 
EXAMPLE  AND  FOR  THE  FIRST  FOUR  TEST  PICTURES.  FOR  TfS  REMAINING  PICTURES 
HE  JUST  RANT  YOU  TO  REPOST  ANY  PROBLEM  THAT  YOU  FIND  WITH  ANY  OF  THE  READ¬ 
INGS.  YOU  HILL  HAVE  THO  MINUTES  TO  LOG  TEE  READINGS  FOR  EACH  OF  THE  FOUR 
TEST  PICTURES,  AND  YOU  MILL  SAVE  30  SECONDS  ID  TELL  US  WHAT’S  WRONt,  WITH 
EACH  PICTURE.  NOW,  LET'S  DO  THE  EXAMPLE  TOGETHER." 

A  sample  itesa  fro®  the  s&ssesger  wart  oh  test  is  presented  ac  Figure  3-1. 


□ 

□ 
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Bach  of  the  four-  throttle  board  pictures  consisted  of  eight  storable  I teas. 
The  exasdsees  uer-s  given  two  points  credit  fox*  eacst  nisoerieally  correct  answer  and 
one  point  cr-edit  for  each  answer  that  was  wiyfein  plus  or  sinus  five  per  ceat  of  the 
afreet  answer.  Thus,  the  aarlsaas  *v*ssiMe  score  on  this  test  was  6$.  %o  credit 
was  given  for  any  answer  outside  the  five  per  cent  ISssits. 

figure  3-2  shows  the  adainistrat ion  of  the  messenger  watch  test  during  the 
criterio,,  data  collect  in??. 


Figure  3-2.  Messenger  watch  test  situation. 


Breakir^-MaJsing  a  Flange 

The  level  of  each  recruit  in  the  followup  m  breaking-asking  a  flange  was 
measured  through  an  individually  administered  parforsattce  test.  In  this  test#  the 
examinee  was  required  to  break  and  sake  a  flangv  using  the  following  tools  and  items: 
{a,  several  gaskets*  (1)  ar,  assembled  six  inch  flange  with  input  valves,  (n)  one 
scraper,  (d)  two  combination  (boxed /open  end )  wrenches,  (e)  one  rag,  sad  (f }  a  bucket 
of  water  with  a  funnel.  Figure  3-3  shows  one  of  the  exaoinsess  taking'*  'lie  breaking- 
g a king  a  flange  test.  Scoring  was  aeccsaplisbfcd  thrown  the  parfcassanc©  checklist  format 
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Figure  3-3.  Breaking-making  a  flange  U  *t  situation. 

Twenty  s coracle  items  were  included  in  this  checklist*  Scoring  was  based  ac 
following  the  correct  procedures,  care  and  use  of  tools,  and  adherence  to  safety 
precautions.  The  directions  to  the  examinees  for  the  breaking -ssaking  a  flat®® 
perfcrsacce  test  are  shown  below: 


"THIS  IS  A  TEST  OF  YOUR  ABILITY  TO  BREAK  AND  HAKE  A  FAULTY  HASSE. 

THIS  IS  A  JOB  OFTEN  DOME  BY  KACKIFIS?  KATES.  YOUR  TASK  IS  TO  BREAK  AND 
JUKE  THIS  FLAJiGE  Ift  THE  SAKE  WAY  AS  YOU  WOULD  ABOARD  SHIP.  AFTER  YOU  ARE 
TOLD  TO  SEGIH,  YOU  WILL  HAVE  8  KIKU7IS  TO  BREAK  AND  MAKE  THE  FAULTY  FLANGE. 
YOU  WILL  BE  SCORED  OH  I.jW  CORRECTLY  PERFORM  EACH  STEP.  YOU  WILL  ONLY 
RECEIVE  CREDIT  FOR  THOSE  ITEMS  7  "  >  .  ZKISH  CORRECTLY  -  ALL  THE  MATERIALS  AM 
TOOLS  YOU  HEED  ARE  AVAILABLE  1U  F-CST  0F70U.  IF  YOU  HAVE  ANY  QUESTIONS, 
PLEASE  ASK  THEM  HOW." 


The  correct  procedure,  in  slightly  condensed  for®,  for  break ing -staking  a 
flange  includes  the  following  steps: 

1.  tightly  closing  valves  on  input  and  output  sides 
of  flange 

2.  pulling  drainplug  to  drain  systea 

3-  using  bucket  to  catch  runoff  frca  system  aid  wiping 
up  any  spillage  that  occurs 

14.  replacing  drain  plug 

S.  braking  flange  apart  using  tao  feoxsnd  wrenches 

&-  reaching  old  gasket 

?.  scraping  aid  cleaning  flange  surface 

8.  inspection  of  flange  surfaces  for  nicks,  scratches 
or  gasket  residue 

9.  replacing  correct  gasket 

10-  replacing  bolts  {hand  tighten) 

11.  tighten  belts  using  cross  over  pattern  with  appropriate 
(box-eni/open  end)  wrenches 

12.  open  both  valves  and  refill  system  with  water 

13-  inspection  of  flange  for  leakage 


Packing  a  Valve 

Ability  on  the  valve  packing  criterion  was  also  measured  rb-ough  ?n  individually 
administered  performance  test.  In  this  1  t,  the  examinee  was  required  to  pack  s  valve 
using:  (a)  a  large,  mounted  valve,  (b)  packing  material,  (c)  a  knife,  fd)  a  box  end 
or  open  end  wrench,  and  (e)  a  packing  puller-  A  photograph  of  the  tools  used  by  tie 
examinee  for  the  valve  packing  test  and  the  previous  performance  test,  break ing-reax log 
a  flange,  is  shown  in  Figure  3-9-  Figure  3-5  shows  one  of  the  test  administrators 
showing  the  available  tools  to  an  examinee.  Figure  3-6  presents  the  examiner  adainister- 
ing  the  valve  packing  test  to  an  examinee. 
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Figure  3-6.  Valve  packing  test  situation. 


We  note  (Figure  3-4)  that  the  examinees  were  given  tools  other  than  those  which 
were  needed  for  completing  the  test.  Accordingly,  the  examinee  was  required  to  select 
from  among  the  available  tools  those  which  were  most  appropriate  for  accomplishing  the 
task.  For  example,  the  exaninee  had  at  his  disposal  the  torrect  size  combination 
wrench  for  loosening  the  nut  holding  the  hand  wheel  to  the  valve.  A  crescent  wrench 
and  a  monkey  wrench-  both  of  which  can  be  adjusted  to  f.  ;  the  handwheel  nut,  were  also 
available  even  though  they  are  inappropriate  wrenches.  Credit  was  lost  if  the  im¬ 
proper  tool  was  aqployed  for  a  give  a  task. 

The  examinee  directions  for  the  valve  packing  test  are  presented  below: 

"THIS  IS  A  TEST  OF  TOUR  ABILITY  TO  PACK  A  VALVE.  THIS  JOB  IS  OFTEN 
DOME  BY  MACHINIST  HATES.  YOUR  TASK  IS  TO  PACK  THIS  VALVE  IN  THE  SAME  WAY 
AS  YOU  WOOLS  ABOARD  SHIP.  AFTER  YOU  ARE  TOLD  TO  BEGIN,  YOU  WILL  HAVE  7 
MINUTES  TO  PACK  THE  VALVE.  YOU  WILL  BE  SCORED  ON  HOW  CORRECTLY  YOU  PERFORM 
EACH  STEP.  YOU  WILL  ONLY  RECEIVE  CREDIT  I®  THOSE  PARTS  OF  THE  JOB  YOU 
FINISH.  ALL  THE  MATERIALS  AND  TOOLS  YOU  NEED  ARE  AVAILABLE  IN  FRONT  OF  YOU- 
IF  YOU  HAVE  ANY  QUESTIONS  PLEASE  ASK  THEM  HOW." 


Scoring  of  the  valve  packing  test  was  also  of  the  performance  checklist  type, 
ikenty- three  storable  items  were  included  in  this  measure.  Scoring  was  based  ro» 
following  the  correct  procedure,  care  and  use  of  tools,  and  adherence  to  sarety 
regulations.  The  correct  procedure,  in  slightly  condensed  fora,  for  packing  a  valve 
included  the  following  steps: 


1.  removing  hand  wheel  nut  with  box  end  or  open  end  wrench 
2  removing  hand  wheel 

3.  removing  packing  nut  with  open  end  wrench. 

«.  removing  packing  gland 

5.  removing  packing  with  packing  puller 

6.  inspecting  to  see  that  all  packing  is  removed 

7.  cut  packing  rings  with  knife 

8.  installing  oacking  rings  neatly  and  pushing  down  with 
gland  until  one-half  of  gland  is  showing 

9.  replacing  gland  and  packing  nut  (hand  tight) 

10.  replacing  handwheel  and  handwheel  nut  (snugged  up  but  not 
over  tightened) 

11.  tightening  with  appropriate  wrenches 


Malfunction  and  Emergency  Procedure^  (Sequential) 

Knowledge  of  correct  actions  in  common  malfunction  and  emergency  situations 
was  tested  through  an  individually  administered  test.  Each  item  in  the  test  consisted 
of  a  set  of  pictures  depicting  a  Fleet  emergency  or  malfunction  correction  sequence - 
The  task  of  the  examinee  was  to  place  the  pictures,  which  were  jresevted  in  scrambled 
order,  in  the  correct  sequence.  To  do  this  the  examinee  must  fii  »c  recognize  what 
ir.  being  represented.  Tile  examinee  directions  for-  this  test  wer  : : 

"THIS  IS  A  TEST  OF  TOUR  ABILITY  TO  ARRANGE  A  SET  OF  PICTURES, 

SHOWING  JOBS  OFTEN  DONE  BY  MACHINIST  HATES  IN  THE  RIGHT  ORDER.  WE 
WILL  GIVE  YOU  A  SERIES  OF  PICTURES.  YOU  ARE  TO  ARRANGE  EACH  SET  OF 
PICTURES  SO  THAT  THEY  HAKE  A  SENSIBLE  STORY .  YOU  WILL  BE  SCORED  OR 
ROW  WELL  YOU  PUT  THESE  PICTURES  IN  THE  CORRECT  ORDER.  YOU  WILL  BE 
ALLOWED  TWO  MINUTES. TO  ARRANGE  EACH  SET  0 T  CARDS  IN  THE  CORRECT 
ORDER— SO  THAT  THEY  TELL  A  STORY .  YOU  WILL  BE  GIVEN  EXTRA  CREDIT 
IF  YOU  CORRECTLY  FINISH  A  STORY  WITHIN  30  SECONDS .  BEFORE  WE  BEGIN 
THE  TEST  LET'S  DO  A  SAMPLE." 

The  various  problems  depicted  (in  order  of  difficulty)  in  the  picture  arrangement 
test  were:  (a)  electric  shock  (sample  item),  (b)  tire  in  comp-art&ent ,  (c)  leaky 
valve  repair,  (d)  tank  gauging,  (e)  fi~e  hose  assembly,  (f)  ruptured  pipe,  and  (g> 
spring  bearing  temperature.  A  sample  item  depicting  the  correct  sequence  for  the  fire 
in  compartment  item,  is  shown  in  Figure  3-7- 
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Scoring  for  each  item  was  based  on  the  nuabar  of  correct  pictorial  connections. 

For  example,  if  the  examinee's  picture  arrangement  was:  1,2, 3, 6, 5, 4,  he  would  receive 
two  points,  because  he  correctly  connected  picture  number  1  with  picture  number  2,  and 
picture  number  2  with  picture  number  3.  On  the  other  hand,  if  tbs  examinee's  picture 
arrangement  sequence  was:  1,2, 3, 4, 5, 6,  he  would  receive  five  points,  because  he  cor¬ 
rectly  ordered  each  pair  of  pictures.  Also,  a  time  bonus  of  one  correct  connection  per 
item  set  was  given  for  each  item  set  correctly  completed  within  30  seconds.  Thirty- 
three  possible  points  could  be  scored  in  this  test. 

Equipment  /Toole  flames  and  Use 

Equipment/tools  names  and  use  criterion  level  was  measured  through  an  individually 
administered  test  which  consisted  of  a  series  of  eight  cards.  Each  card  depicted  a 
typical  machinist  sate  work  situation  along  with  pictures  of  three  tools  which  might  be 
employed  to  complete  the  task.  For  erch  item,  the  exasdnea  was  required  to  select, 
from  among  the  three  tools  shown,  the  best  one  for  cospl&tixg  the  specific  job  depicted . 
The  examinee  received  additional  credit  if  he  was  also  able  to  name  the  tool  and  tell 
why  it  is  the  best  tool  to  use.  A  sample  item  is  shown  in  Figure  3-8. 

A  sample  examiner  proctol  for  the  items  shown  in  Figure  3-8  is  presented  below: 

a)  Show  examinee  picture  of  a  nut  with  a  socket. 

b)  Say: "THIS  IS  A  NUT  WITH  A  SOCKET." 

c)  Show  examinee  wrench  pictures  for  item  2. 

d)  Say:  "WHICH  WRENCH  WOULD  YOU  USE  TO  FREE  THE  HIT?" 

e)  Pause  15  seconds  for  his  answer. 

f)  After  a.  response  is  given  continue  by  saying:  "k hat 
IS  THE  TOOL  CALIE0?" 

g)  Pause  10  seconds  for  his  answer. 

h)  Go  on  to  question  13  if  the  first  part  of  this 
question  was  answered  correctly. 

In  some  instances  an  exes. 'nee  would  identify  the  incorrec ■  tool  for  a  particular 
situation.  If  he  was  able  to  ,«%.•«  the  tool  he  identified,  rega.  dless  of  whether  or  not 
it  was  the  right  tool  to  use,  he  received  one  point  credit  for  that  item.  Thirty- two 
possible  points  could  be  achieved  on  this  test. 


General  Alertness/Coiaaoa  Sense  (what’s  wrong) 

General  alertness/ ccroon  sense  criterion  performance  vas  «,,■  -o  measured  through  a 
pictorial,  individually  administered  test.  Each  ites  consisted  of  a  picture  of  a  typical 
machinist  mate  work  situation  in  which  a  sailor  is  shown  doing  scuething  wrong.  The  task 
of  the  examinee  was  to  detect  and  report  what  is  wrong  or  absurd  in  the  picture.  The 
examinee  directions  for  the  general  alertness/comaon  sense  test  er*  shown  on  the  follow- . 
if»g  page 


■wtfi  •w"riyw  v*™*^** 


"THIS  IS  A  TEST  OF  YOUR  ABILITY  TO  SEE  WHAT  IS  WRONG  IK  A  DRAWERS 
OR  DIAGRAM.  WE  WILL  PLACE  THE  DRAWING  OR  DIAGRAM  IK. FRONT  OF  YOU.  YOU  WILL 
BE  ALLOWED  OHE  MINUTE  TO  TELL  US  WHAT  IS  WRONG  WITH  EACH  DRAW I  IK?  OR  DIAGRAM. 
POINT  TO  THE  PROBLEM  ON  THE  DRAWING  OR  DIAGRAM  IF  YOU  ARE  HOT  ABLE  TO  TELL 
US  WHAT  THE  PROBLEM  IS.  LET’S  TRY  A  SAMPLE  PROBLEM.*’ 


Two  sample  examiner  proctcis  frost  the  general  alertness/coraion  sense  test  are 
shows!  below.  Also  shown,  in  Figure  3-3,  is  a  sample  ger«  al  alertness/ccramon  sense 
test  item. 


1.  Problem  -  Man  using  screwdriver  as  chisel. 

a) show  examinee  picture 

b) Say:  "WHAT'S  THE  MAN  IN  THE  PICTURE  DOING  WRONG?” 


2-  Problem  -  Man  using  chisel  with  a  mushroom  end. 

3>Shov  examinee  picture 

b)Say;  "WHAT’S  THE  HAN  IK  THE  PICTURE  DOING  WRONG?” 


Figure  3-R  Sample  general  alert ness/cossmon  sense  itea. 
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The  items  in  the  general  alertness/conaaon  sense  test  were  arranged  in  ascending 
order  cf  difficulty.  In  scoring  this  test,  the  examinee  was  allowed  two  points  for 
each  correct  answer,  and  one  point  credit  if  he  pointed  co  the  problem  or  gave  a  correct 
but  not  fully  insightful,  answer.  One  point  was  subtracted  from  the  examinee's  score 
for  each  incorrect  or  unanswered  question.  Since  this  test  consisted  of  seven  items, 
the  total  possible  score  attainable  by  the  examinee  was  14. 


Establishment  of  Hialraally  Acceptable  and  Desirable  Scores 

Tne  Delphi  Technique,  developed  by  the  Rand  (Heliaer,  1967;  Dal  key ,  1967  ;  Brown, 
1966;  Dalkey  1962;  Dalkey,  1969;  Martino,  1972)  Corporation,  was  used  to  establish 
''minimally  acceptable’-  and  "desirable"  performance  scores  on  each  of  the  criterion  tests. 
Tne  Delphi  Technique  is  essentially,  a  method  of  converging  the  opinions  of  a  small  group 
of  "experts."  Each  "expert"  is  asked  a  question  designed  to  elicit  -  quantitative 
estimate  or  opinion.  The  question,  in  the  present  case,  involved  the  "minimally  accept¬ 
able"  and  "desirable"  score  (defii.  *  below)  on  each  of  the  criterion  tests.  Each  judge 
assigns  his  estimate  individually  without  conferring  with  the  other  members  of  the  group. 
Jn  the  current  application,  once  the  estimates  were  assigned,  they  were  collected  and 
placed  on  a  blackboard  so  that  each  individual  could  review  his  estimate  in  the  context 
of  the  other  group  members'  estimates.  Then,  each  "expert"  was  asked  to  justify  his  own 
opinion.  Following  the  justification  procedure,  the  "experts"  were  again  asked  to  assign 
quantitative  estimates  to  the  same  question.  This  procedure  was  followed  for  as  many 
trials  as  required  for  the  group  to  arrive  at  a  convergence  of  opinion.  Four,  experi¬ 
enced  machinist  mates  from  the  instructional  staff  of  the  Damage  Control  Training  Center 
at  the  Philadelphia  Navy  Yard  served  as  judges  within  the  Delphi  procedure.* 

1 .  Minimally  Acceptable 

The  lowest  score  on  this  test  that  you  would  accept  as  indicating 
that  you  could  make  somt  use  of  this  man  as  -a  machinist  mate  striker. 

This  score  would  indica- e  that  the  man  probably  requires  considerable 
supervision. 

2.  Desirable 


The  score  on  this  test  that  you  would  expect  a  man  to  attain  >efore 
you  would  consider  him  at  a  desirable  level  of  capability.  This  score 
would  indicate  that  the  man  is  probably  able  to  accomplish  his  assigned 
tasks  with  only  minimum  supervision. 


♦One  Chief  Warrant  Officer,  one  Master  Chief  Petty  Officer,  one  Chief  Petty  Officer,  and 
one  First  Class  Machinist  Mate. 
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A  sample  judgment  for  the  malfunction  and  emergency  procedures  (seque 
picture  arrangement)  test  is  shown  in  Figure  3~10. 


bequential  Picture  Arrangement  Vest 
33  Total  Points 


Your  name  (print) 


Directions 


Please  estimate  what  you  would  consider  a  desirable  score  f^r  an  apprentice 
machinist  mate  striker  in  the  Fleet. 


Desirable  Score 


joints 


Please  estimate  what  you  would,  consider  a  minimally  acceptable  score  for  an 
apprentice  machinist  mate  striker  in  the  Fleet. 

Minimally  Acceptable  Score  points 


Figure  3-10.  Sample  judgment  form  used  in  the  Delphi  procedure. 


Trior  to  rating,  tt'.e  judges  were  giver,  a  detailed  explanation  of  the  purpose 
content,  ana  scoring  of  each  test,  Facr.  ter.  was  reviewed  item  r, y  item  in  order 
that  the  raters  could  obtain  an  estimate  of  the  content  and  difficulty.  In  most 


est,  pictures  were 

the  -arers.  In  addition,  the  total  p 
points  were  subtracted  for  incorrect 
?  junction  the  forms  were  completed  r. 


Ir.  order  to  illustrate  the  dynamic 
al,  groan  convergence  is  presented  in  Table  3 


Table  3-1 


Delphi  Convergence  cf  Opinion  for  ''Miniaally  Acceptable"  (MA) 
and  "Desirable"  (D)  Performance  on  the  Equip¬ 
ment/  Too  Is  Hasses  and  Use  Test 


Trial 


Rater 

i 

2 

3 

IT" 

RA 

TT 

Ma 

IT 

■“TBT 

1 

30 

25 

28 

22 

25 

19 

2 

24 

18 

26 

18 

26 

18 

3 

25 

19 

25 

19 

25 

19 

4 

24 

16 

27 

21 

24 

19 

X 

25.  75 

19.50 

26.50 

20.00 

25.00 

IS. 75 

In  this  sxarapie ,  raters  2  anc  regained  steadfast  in  ..heir  opinions  throughout 
ail  three  trials.  Alternatively,  rater  i  -4ecrea3 ed  both  of  his  estimates  throughout 
the  entire  process,  while  rater  4  increased  his  estimate  on  cr.Iy  the  ’Vaininaliy  ac¬ 
ceptable’’  sec-re.  The  final  average  Delphi  estimates  for  each  of  the  criterion  tests 
are  presentee  in  Table  3-2. 


Table  3-2 

Final  Delphi  Estimate  of  "Minimally  Acceptable"  and  "Desi rable" 
Criterion  Scores  on  Seven  Performance  Criteria 


Criterion 

Messenger  Watch  Recording 
Messenger  Watch  Malfunction  Detection 
Breaking -Making  a  Flange 
Packing  a  Valve 

Malfunction  and  Emergency  Procedures 
Equipment/Tools  Naaes  and  Use 
General  Alertness/Coamon  .Sense 


Minina liy 

Acceptable  Desirable 


39.75 

51.00 

4.50 

5.75 

S.75 

12.25 

9.25 

13.75 

18.25 

24.75 

18.75 

25.00 

5.50 

8.50 
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Criterion  Samples 

Fifty-four  of  the  99  recruits  originally  tested  were  available  as  followup 
subjects  for  the  present  phase  cf  the  investigation.  Twenty-nine  of  the  followup 
subjects  were  black  and  25  were  white.  The  reasons  for  the  attrition  of  the  sample 
are  presented  below.  First,  30-35  percent  of  the  sample  left  the  service.  It  seems 
as  though  easy  of  these  individuals  were  behavior  problems.  Second,  several  ships  on 
which  members  of  the  sample  were  assigned  were  deployed  during  the  followup  testing. 

This  caused  a  loss  of  another  five  or  six  subjects.  Finally,  a  snail  number-  of  subjects 
seemed  to  be  urvlocatable  or  on  sick  leave. 

Forty-nine  of  the  54  subjects  were  stationed  and  tested  in  the  Norfolk,  Virginia 
aret.*  The  remaining  five  subjects  ware  tested  at  the  Charleston,  South  Carolina  Kas¬ 
ai  Ease.  Approximately  half  of  the  subjects  were  tested  in  large  staterooms,  on  mess- 
decks  ,  or  in  libraries  aboard  ship.  The  remaining  subjects  were  tested  in  classrooms 
or  conference  rooms  which  were  located  in  close  proximity  to  the  prior  areas.  There 
was  in  all  cases  adequate  space,  ventilation,  and  lighting  for  testing  purposes. 

A  small  number  {6-8 >  cf  the  subjects  in  the  lew  aptitude  sample  were  being 
utilized  aboard  ship  in  engineering  ratings,  (e.g..  Boiler  Technician,  Engineraan,  hull 
Technician)  other  than  Machinist  Hate.  It  was  felt  that  these  ratings  were  similar 
enough  to  the  Machinist  Mate  rating  that  the  criterion  tests  could  appropriately  be 
used  for  these  individuals  as  well. 


A  trained,  black  test  administrator  administered  the  criterion  tests  to  the  black 


ejects 


re 


in  the  sample.  This  black  test  administrator  was  25  years  old  and  college 
educated.  Jr.  addition,  he  served  tferee  years  in  the  arsed  forces-  A  2C  year  ole  -bite 
psychologist  served  as  test  administrator  for  the  white  subjects  in  tbs  followup  sample. 


Testing  time  consumed  between  or.e  and  one-half  and  two  nours  per  examine  .  Fa  -  r. 
testing  session  began  with  a  short  preliminary  conversation  ir  order  to  establish 
rapport  with  the  examinee.  This  was  followed  oy  administration  of  the  sev«n  criterion 
measures  and  a  sul ject  interview.  Table  3-3  shows  the  various  number  a*,  i  types  of  ships 
by  command  on  which  the  low  aptitude  subjects  were  stationed  during  their  ‘ our  of  I -ty 
ir.  the  Navy. 


*N'ori'oik  Naval  Base,  Little  Cree'r  Amphibious  Base,  Portsmouth  Nav? :  Shipyard. 
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Coaaar.d 


Number 


Tj 


SERVLAHT 

»* 


AIRLANT 

CRUDESLAJi? 

T1 

ft 

M 

SU3LANT 

PHXSLAST 


MXH£WA5fA?£ 


2 

2 

1 

2 

1 


3 

2 


■A 

1 

o 

2 

1 

i 


CoaiiM?  Stores  Ships 
Ui lers 
Repair  Ship 
Stores,  Ships 

Fast  Cc&bat  Support  Ship 
Asauniticn  Ship 
aircraft  Carriers 
Guided  Missile  Destroyers 
"  "  Frigates 

Escort  Ship 
Destroyer 
Submarine  Tenders 
Amphibious  Transport  Docks 
Tank  Landing  Ship 
Mine  Warfare  Ship 


Control  Saropl< 


In  addition  to  the  low  aptitude  criterion  subjects,  a  sample  of  27  A  school 
graduate  machinist  mate  strikers  was  aivc  tested.  This  group  of  examinees  served  as 
a  comparison  ar.d  control  group  for-  the  purpose-  of  determining  the  average  performance 
level  of  the  typical  machinist  state  striker-  in  the  Fleet.  All  27  c-f  these  control 
subjects  were  white,  because  the  overwhelming  majority  of  A  school  graduates  are  white 
The  control  subjects  had  about  the  same  a  count  cf  Fleet  experience  as  the  low  aptitude 
sample.  As  ar:  adder  control  measure  these  control  subjects  wer-e,  in  ail  cases,  taken 
free  the  same  ships  as  the  low  aptitude  subjects.  Ail  of  the  control  subjects  were 
required  to  take  the  tests  in  exactly  the  same  manner  and  under  the  same  ccndirie-.r.  as 
the  low  attitude  sasoie. 


Supervisory  Ratings 


The  isEieoiare  supervisor  of  each  person  in  the  low  aptitude  followup  sample 
was  interviewed  in  order  to  ascertain  the  supervisor's  perception  of  the  incumbent 
machinist  Tsata  striker.  As  mentioned  earlier,  we  place  little  credence  in  the 
validity  of  supervisory  ratings,  since  they  are  subject  to  a  multitude  of  biasing 
influences.  Nevertheless,  the  supervisory  evaluations  were  collected  because  they 
have  some  inherent  face  value  as  a  reflector  of  journeyman  ability.  The  actual 
quest  .ons  ir.  the  supervisory  interview  aj'e  presenveo  in  Figure  3--1J. 
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3.  Can  yea  please  list  for  me  his  strengths  and  other  strong  points? 

U.  « 

b. 

c. 

d. 

>u  Can  yoa  please  list  his  weaknesses? 

a. 

K 

*J  • 

c. 

d. 


What  are  his  chances  of  advancing  to  a  higher  rate  (E3  or  £4)  during  the 
next  year?  ( Show  card ) 

chances 


7.  When  comparing  him  with  others,  at  a  similar  level  of  experience,  how  does 
he  perform  technically?  (Svow  card) 

a.  Better  than  most  others  at  a  similar  level. 

b.  About  the  same  as  most  others  at  a  similar  level- 

c.  _ Net  as  good  as  most  others  at  a  similar  level- 

6.  When  comparing  him  with  others,  at  the  next  highest  level,  how  does  he 
perform  technically?  ( Show  card) 

a.  _ Better  than  most  others  at  the  next  highest  level. 

b.  _  About  the  same  as  most  others  at  the  next  highest  level. 

c.  _ Not  as  good  as 'most  others  at  the  next  highest  level. 

9.  To  what  extent  does  he  meet  your  standards  of  technical  performance? 

( Show  card) 

3. _ He  exceeds  my  standards  of  technical  performance. 

b.  _ He  meets  my  standards  of  technical  performance. 

c.  _ He  dees  not  meet  my  standards  of  technical  performance. 

10.  If  you  were  given  the  opportunity  of  choosing  your  ss-bordinates  would 
you  choose  him?  ( Show  ca. d ) 

a.  _ Yes.  -definitely. 

b .  _ Yes,  If  nc  one  else  were  available. 

c- _ No,  definitely  not. 


Fi.gu’-e  3-11. 


Supervisory  interview  performance  evaluation  questions. 


Questions  1,  2,  and  S  of  the  supervisory  interview  were  concerned  with  the 
proportion  of  time  the  inciwabent  spent  performing  various  tasks  on  the  job. 
The  questions  are-sfiown  in  Figure  3-12. 


37 


Job  Task  Analysis  Interviews 

'Within  the  supervisory  interview,  each  supervisor  was  aske<i  to  estimate  the 
average  number  of  hours  per  week  that  the  individual  low  aptitude  sailor  being 
supervised  spent  on  various  machinist  sate  tasks.  The  questions  employed  ere 
presented  as  Figure  3-12.  The  purpose  in  collecting  this  information  was  to  deter¬ 
mine  whether  or  not  the  low  aptitude  sample  was  given  an  opportunity  to  learn  the 
machnist  mate  rate  in  the  Fleet.  If  a  significant  proportion  of  the  sample  was 
not  primarily  engaged  in  learning  the  rate,  then,  poor  criterion  performance  could 
be  ascribed  to  factors  other  than  low  ability.  The  task  interview  questions  are 
showT.  in  Figure  3-12.  The  individual  test  subjects  were  also  asked  ?.  similar  set 
of  task  analytic  questions.  These  test  subject  task  analytic  questions  are  pre¬ 
sented  .n  Figure  3-13. 


l. 

For  an  average  week,  can  you  tail  ae  how  many  hours  per  week 

performs  the  following  activities: 

Name  of  Subject 

a.  Packing  valves,  making  gaskets,  and  ensuring  integrity  of 
fittings 

hours 

b.  Standing  messenger  watch 

hours 

c.  Galley  or  kitchen  duty 

hours 

d.  General  clean  up 

e.  Working  with  equipment  and  machinery  (light  off 
pumps ,  check  bearing  temperatures,  souna  tanks. 

hours 

lubricate  machinery,  etc.). 

hours 

i*  Damage  control  and  standing  firewatch 

hours 

2, 

What  other  activities,  both  machinist  mate  and  non-machinist  mate 
related,  does  he  perform  each  week?  And  what  are  their  hours? 

3  •  _ 

hours 

b. 

hours 

C  * 

hours 

d. 

hours 

5.. 

On  the  average,  how  many  hours  a  week  of  on  the  job  training  and 
instruction  ••fas  he  given  during  his  first  six  months  on  the  job? 

Jiours 

Figure  3-12.  Supervisory  job  ts.sk  analytic  interview  questions. 
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SUBJECT  INTERVIEW 

How  many  hours  a  week  do  you  pack  valves,  make  gaskets,  and  break 
flanges? 


hours 


How  many  hours  a  week  do  you  stand  messenger  watch  at  the  throttle 
boar  I? 

hours 


How  many  hours  a  week  do  you  perform  galley  or  kitchen  duty? 


hours 


How  many  hours  a  week  do  you  sweep  up  decks,  mop  up  dirt  and  the 
like? 

hours 


How  many  hours  a  week  do  you.  work  with  equipment  and  machinery  (e.g., 
light  off  pumps ,  sor.id  tanks,  check  bearing  temperatures,  lubricate 
machinery,  etc.}? 

_ hours 

How  many  hours  per  week  are  you  involved  in  damage  control  activities 
or  standing  firewatch? 

hours 


How  mny  other  activities  do  you  perform  each  week  (e.g.,  fixing 
equipment,  starting  up  and  maintaining  different  kinds  of  equipment 
and  machinery,  other  non-machinist  mate  activities? 


a. 

hours 

b. 

hours 

c. 

be  ,rs 

d. 

ho.?rs 

How  aany  hours  do  you  perform  these  activities  each  week? 

Do  you  think  you  were  given  a  fair  opportunity  to  learn  the  machinist 
ante  rate  during  the  past  6  months  aboai'd  ship?(Ask  him  to  explain) 


Figure  3-13.  Test  subject  job  task  analytic  interview  questions. 


CHAPTER  I V 


RESULTS  AND  DISCUSSION 


Chapter  IV  presents  the  results  of  the  Fleet  performance  evaluation.  The 
chapter  contains  sections  or,:  (a)  criterion  analysis,  (b)  predictive  validity,  (c) 
comcarison  of  low  aptitude  and  A  school  graduate  performance ,  (d)  differential  val¬ 
idity,  (e)  supervisory  interview  analysis,  and  (f)  analysis  of  interviews  with  low 
aptitude  sample. 

Criterion  Analysis 

In  order  to  ascertain  the  relationships  that  exist  among  the  criterion 
measures,  correlation  matrices  were  developed.  These  allow  determination  of  the 
extent  of  independence  and  free  iom  from  redundancy  among  these  variables.  The  u- 
niqueness  of  the  miniature  aptitude  tests  had  been  previously  established  {Siegel  6 
Bergman.  1S72).  Table  4-i  presents  the  intercorrieations  among  the  Fleet  criterion 
measures  on  the  basic-  of  the  scores  on  these  measures  of  the  54  low  aptitude  subjects. 
Tables  '--2  and  4-3  present  the  means  and  standard  delations  for  the  predictors  and 
criterion  measures  respectively. 


Table  4-1 

Pearson  Product  Moment  Correlations  between  the  Brea k?ng -Making 
a  flange  (SMr),  Pecking  a  Valve  (FV),  Tool  knowledge  and  Usage 
(TKU) .  oecj'K-nt  jal  (fee,) ,  What’s  arong  (WW),  Meter  Reading-Mes¬ 
senger  Watch  (MR -MW) ,  and  the  Troubleshooting  Messenger  Watch 
(Tr-MK)  Ciiterion  Measures,  and  Supervisory  Ratings  (SR),  for  54 
Low  Aptitude  Machinist  Mate  Strikers 


Criterion  Measure 


PV 

TKU 

Scvj 

KW 

MR-MW 

Tr-MW 

SR 

8Mf 

.  1  i 

.  50 

.  17 

-  25 

.07 

.12 

>23 

PV 

.01 

-.li 

.10 

.  29 

.09 

-.05 

TKU 

,  U5 

.25 

.  23 

.09 

,18 

Seq. 

.10 

.04 

-.11 

-.09 

WW 

v  37 

.16 

.14 

MR -MW 

.  i5 

.30 

Tr-MW 

.G7 
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Table  4-2 

Means  and  Standard  Deviations  on  the  GCT,  ARI,  MECH,  Equipment 
Use  and  Nomenclature  (EUN),  Gasket  Cutting  (GC),  Meter  Reading 
(kfil).  Troubleshooting  (T),  Equipment  Operation  (EG),  and  Assem- 


Test 

Mean 

S.D, 

GCT 

39.20 

7.07 

ARI 

42.39 

5.17 

MECH 

42.31 

5.18 

EUN 

20.80 

2.84 

GC 

14.67 

2.66 

MR 

11.81 

3. 85 

T 

15.37 

5.12 

EO 

59.72 

6.51 

A 

23.41 

3.61 

Table  4-3 

Means  and  Standard  Deviations  on  the  Breaking- Making  a  Flange  (BMF), 
Packing  a  Valve  (PV),  Too*  Knowledge  and  Usage  (TKU) ,  Seouential 


(Seq),  What's  Wrong  fWW) ,  Meter  fteading-Messt?nger  Watch  (MR -MW) ,  and 
Troubleshooting-Messenger  Watch  (fr-MW)  criterion  Measures  and  Super¬ 
visor)-  Ratings  (SR)  for  54  Low  Aptitude  Machinist  Mate  Strikers 


Test 

Ms  an 

S.D. 

BMF 

6.41 

1.84 

PV 

9.43 

2.82 

TKU 

20.02 

4.83 

Seq 

11.61 

3. 65 

WW 

3.55 

3.25 

MR -MW 

55.91 

5.42 

Tr-MW 

13.7* 

2.02 

SR 

-.30^ 

17.68* 

•Based  on  s  summation  of  standard  scores. 
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Predictive  Validity 

Three  separate  types  of  analyses  were  performed  in  order  to  ascertain  the  rela¬ 
tionship  between  the  predictor  instruments  (miniature  aptitude  tests  and  Navy  predict¬ 
ors)  and  the  criterion  measures.  These  analyses  were  performed  primarily  to  assess  the 
predictive  power  of  the  miniature  aptitude  tests  and  secondarily  to  compare  tl.-e  predict¬ 
ive  power  of  the  miniature  aptitude  tests  with  the  predictive  power  of  the  usual  Navy 
tests. 


In  the  first  analysis,  composite  predictor  scores  were  correlated  with  a  composite 
criterion  measure  for  both  the  A  school  graduate  sample  and  the  low  aptitude  sample.  The 
specific  predictor  composite  scores  involved  were:  (1)  composite  Navy  test  scores  con¬ 
sisting  of  the  sum  of  the  scores  on  the  three  Navy  predictor  tests,  and  (2)  the  sum  of 
the  standard  scores  of  the  six  miniature  aptitude  tests.  The  composite  field  criterion 
measure  consisted  of  the  sum  of  standard  scores  of  the  Fleet  performance  tests.  These 
composite  score  correlations  were  calculated  in  order  to  satisfy  the  economic  and  logical 
aspects  inherent  to  any  validity  study,  i.e.,  to  provide  a  useful,  summary  index  of  tl,e 
predictor-criterion  relationship.  A  single  correlation  measure  as  opposed  to  a  series 
correlations  can  be  most  useful  in  some  descriptive  circumstances.  The  Pearson  p'odi 
moment  correlation  coefficient  between  the  composite  Navy  predictors  and  the  cor  ,s?- 
field  criterion  for  the  A  school  graduates  was  -.01.  This  indicates,  at  least  in  the 
context  involved,  that  the  Navy  predictors  possessed  no  predictive  relationship  with  the 
criterion  employed. 


The  product  moment  correlation  coefficients  between  the  composite  Navy  and  the 
miniature  aptitude  predictors  and  the  composite  field  criterion  for  the  low  aptitude 
sample  were  -33  and  .49  respectively.  While  there  is  no  statistically  significant 
difference  between  these  coefficients,  the  miniature  aptitude  predictors  accounted  for 
more  than  twice  as  much  predictable  criterion  variance  as  did  the  Kavy  predictors.  It 
has  bean  suggested  that  the  usual  Navy  test-composite  criterion  correlation  should  be 
corrected  for  the  restricted  range  of  low  aptitude  sample  on  the  usual  Navy  tests.  We 
do  net  believe  ;hat  a  correction  for  restriction  ir.  range  is  appropriate  here  because 
the  range  of  Navy  test  scores  in  the  low  aptitude  population  is,  in  fact,  restricted. 
The  findings  indicate  that  if  the  low*  and  high  aptitude  data  are  combined  so  as  to 
eliminate  range  restriction,  the  composite  predictor-criterion  correlation  for  the  Navy 
predictors  increase  to  :  value  of  -.53.  Tnis  grouping,  though,  leads  to  the  spurious 
conclusion  that  the  Navy  teSTS  are  equally  as  powerful  as  the  miniature  aptitude  tests 
U'=  .u3)  for  predicting  Fleet  criterion  performance  for  the  low  aptitude  sample.  This 
conclusion  is  held  to  be  spurious  on  both  logical  and  statistical  grouses. 


On  a  logical  basis,  it  would  be  incorrect  to  combine  the  high  and  low  aptitude 
groups,  because  the  subgroups  comprising  this  combination  differ  qualitatively  from  each 
other.  These  qualitative  differences  consist  of  factors  such  as  educational  opportunity, 
reading  habits,  racial  composition,  interests,  and  motivation.  In  a  sense,  combining  the 
low.  and  high  aptitude  groups  in  the  present  investigation  is  akin  to  performing  a  validity 
study  in  which  clerical  workers  and  bus  drivers  are  combined. 


*A  random  sample  of  27  low  aptitude  subjects  was  combined  with  the  sample  of  27  high 
aptitude  subjects. 
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Finally,  if  the  data  were  available  the  composite  miniature  aptitude  predictor- 
composite  criterion  correlation  could  also  be  spuriously  elevated.  We  have  already 
observed  that  the  A  school  subjects  when  considered  collectively ,  score  higher  on 
the  composite  criterion  than  the  low  aptitude  subjects.  In  addition,  it  is  exceed¬ 
ingly  likely  that  the  A  school  subjects  would  also  score  somewhat  higner  on  the 
composite  miniature  aptitude  predictors.  Unfortunately,  these  latter  data  are  not 
available.  We  estimate  though  that  this  correlation  would  increase  to  at  least  .55 
if  the  data  were  available. 

The  second  set  of  correlational  analyses  involved  the  calculation  of  the  mul¬ 
tiple  correlation  coefficient  between:  (1)  the  Navy  predictors  and  the  composite 
criterion,  and  (2)  the  miniature  aptitude  oredictors  and  the  composite  criterion. 

The  multiple  correlation  coefficient  between  the  three  Navy  predictors  and  the  com¬ 
posite  criterion  was  found  to  be  .36.  Fhe  multiple  correlation  between  the  six 
miniature  aptitude  predictors  and  the  composite  criterion  measure  was  found  to  be 
.50.  The  results  are  directionai.'v  analogous  to  those  derived  from  the  simpler  com¬ 
posite  predictor  and  composite  criterion  case.  When  only  three  of  the  miniature 
aptitude  tests  were  included  as  predictors,  the  correlation  was  still  relatively 
high--. 46.  Again,  the  miniature  aptitude  tests  account  for  twice  as  much  benavioral 
variation  as  the  usual  Navy  predictor  tests. 

In  tr,e  final  correlation  analysis,  multiple  correlation  coefficients  were  calcu¬ 
lated  between  the  two  sets' c-r  predictors  (Navy  and  miniature  aptitude)  and  each 
Fleet  criterion  measure.  These  data  analyses  allow  deeper  insight  into  the  relative 
contribution  of  various  predictors  to  the  overall  correlation  and  also  provide  a 
weighted  basis  for  prediction,  on  an  individual  criterion  basis,  cf  criterion  score 
from  predictor  scores. 


Table  4-4  shows  the  multiple  correlation  coefficients  for  the  prediction  of 
field  criterion  performance  by  the  Navy  ar.d  the  miniature  aptitude  predictors.  Table 
4-4  indicates  that  the  GCT,  ARI,  and  MECH  composite  predicted  with  statistical  signi¬ 
ficance  (p <  .05)  the  tool  knowledge  and  usage,  meter  reading-messenger  watch,  and 
what’s  wrong  criteria  for  the  54  low  aptitude  Navy  machinist  sate  strikers  while  the 
miniature  aptitude  tests  predicted  performance  (p<  .05)  on  the  supervisory  ratings, 
tool  kn »wledge  and  usage,  what's  wrong,  and  meter  reading-messenger  wa^ch  criteria. 
Here  again,  there  is  3  tendency  for  the  miniature  aptitude  tests  to  be -more  predic¬ 
tive  than  the  usual  Navy  tests.  Not  only  was  one  sore  statistically  icant 

correlation  coefficient  yielded  but  also  of  the  eight  pairs  of  coefficients  Included 
in  Table  u-4,  the  value  for  the  miniature  aptitude  tests  is  greater  than  tnat  for 
the  usual  Navy  tests  in  six  cases.  The  directional  difference,  as  tested  by  the 
"sign  test,"  is  statistically  significant  at  the  .145  level  of  confidence. 
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Table  4-4 


Multiple  Correlation  between  the  Two  Sets  of  Predictors*  and  the 
Individual  Criterion  Measure  (N=  54) 

Predictor 


Criterion 

Break ing-Makiag  a  Flange 

Packing  a  Valve 

Tool  Knowledge  and  Usage 

Sequential 

What's  Wrong 

Meter  Reading-Messenger  Watch 
Troubleshooting-Messenger  Watch 
Supervisory  Ratings 


.17 

.14 

.47** 

.33 

.39** 

*41 

.43** 

.32 


Miniature 
Aptitude 
.SO 
.22 
_  42** 
.35 
.43** 
.46*** 
.22 
.39** 


‘Only  the  best  three  miniature  aptitude  predictors  were  utilized  in  «*rder  that 
direct  coeparisons  between  the  two  types  of  predictors  could  more  readily  be  made. 
**  p<.05 

***  p<.01 


A  complete  listing  of  rhe  multiple  regression  equations  for  predicting  the 
individual  criteria  from  the  various  predictors  is  presented  in  the  appendices  of 
this  report. 

Comparison  of  Low  Aptitude  and  A  School  Graduate  Performance 

Another  question  of  concern  is  that  of  whether  cr  not  recruits  who  show  little 
promise  of  succeeding  in  the  machinist  mate  rate,  according  to  the  traditional  Navy- 
tests  (GCT,  ARI  and  MECH),  can  be  identified  by  other  predictors  (miniature  aptitude 
tests)  and,  in  fact,  be  shown  to  succeed  according  to  an  "on  the  job"  field  criterion. 
7c  this  end  t-tests  were  performed  to  ascertain  the  statistical  significance  of  the 
petfun;>ance  differences,  if  any,  between  the  36  low  aptitude  subjects  who  were  pre¬ 
dicted  to  be  successful  in  the  Fleet*  and  the  27  A  school  graduates  who  met  the  Navy 
aptitude  test  requirements.  The  mean  and  the  standard  deviation  of  che  low  aptitude 
subjects  who  were  judged  to  "pass"  the  miniature  aptitude  screen  and  the  A  school  grad¬ 
uate  samples  or  each  criterion  test  are  shown  in  Table  4-5.  According  to  Table  4-5,  the 
perforssar.ee  of  the  A  senool  graduates  was  significantly  superior  to  that  of  the  low 
aptitude  sample  (those  who  failed  on  traditional  Navy  aptitude  tests)  on  five  of  the 
sever,  field  criterion  tests. 


*A  subject  was  predicted  to  *  successful  in  the  Fleet  if  he  passed  the  troubleshooting 
test  and  any  of  two  of  the  regaining  five  tests  in  the  riniature  aptitude  battery. 
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Table  4-6  presents  the  percentage  of  lew  aptitude  subjects  predicted  to  achieve 
seme  degree  of  field  success,  according  to  miniature  aptitude  predictors,  who  attained 
or  exceeded  the  25th  and  SCfth  percentile  level  of  scores  achieved  by  the  A  school  grad¬ 
uate;,  on  the  field  criterion  tests. 

Clearly,  a  considerable  number  of  the  ©embers  of  the  low  aptitude  group,  in  ©any 
cases,  performed  on  these  criterion  tests  as  well  as,  or  even  better  than,  some  of  the 
high  aptitude  A  school  graduates. 

Table  4-7  shows  the  performance  of  the  A  school  graduate  and  the  low  aptitude 
groups  on  the  basis  of  the  percentage  of  individuals  from  each  group  whose  criterion 
scores  fell  into  each  of  the  three  criterion  referenced  categories — desii  le,  mini¬ 
mally  acceptable,  and  below  ©initially  acceptable — as  derived  from  the  Delj^ni  applica¬ 
tion.  Cn  the  whole,  ©ore  of  the  criterion  scores  of  the  A  school  graduates  fell  into 
the  upper  two  categories.  However,  the  data  of  Table  4-7  also  indicated  that:  (l) 
eight  per  cent  of  the  low  aptitude  machinist  mates  performed  at  or  above  the  minimally 
acceptable  level  cn  the  Breaking-Making  a  flange  test,  (2)  61  per  cant  performed  above 
the  minimally  acceptable  level  on  the  Packing  a  Valve  test,  (3)  79  per  cent  performed 
above  the  minimally  acceptable  level  on  the  Tool  Knowledge  and  Usage  'sst,  (4)  6  per 
cent  fell  above  the  minimally  acceptable  level  on  the  Sequential  tes,,  and  (5)  28  per 
cent  performed  above  the  minimally  acceptable  level  on  the  What’s  Wrong  test,  (6)  99 
and  100  per  cent  respectively  scored  above  the  minimally  acceptable  level  on  the  tieter 
Reading  ar.d  Trouble  Shooting  tests.  Accordingly,  it  seems  that  the  miniature  aptitude 
tests  can  glean  from  a  group  of  non  Havy  test  qualified  persons  these  individuals  who 
can  reach  fleet  performance  criteria  with  six  months  Fleet  experts. see.  This  could 
represent  a  considerable  harvest  in  terms  of  both  manpower  and  equity.  Vineberg  and 
his  colleagues  (Vineoe:*,  Taylor,  €  Cayicr,  1970;  Vineberg  l  Taylor,  1372a; Vineberg  & 
Taylor,  1372b)  have  also  found  that  a  substantial  portion  of  low  aptitude  subjects  can 
perform  adequately  on  military  jots-. 

Differential  Validity 

v.iver.  the  findings  in  Table  4-7,  t;  -  Questions  of  whether  or  not  the  miniature 
aptitude  tests  are  mere  "culture-fair"  than  the  traditional  Havy  predictors,  and,  more 
specifically,  whether  the  riniature  aptitude  te.crs  are  differentially  valid  remain  to 
be  answered. 

The  first  step  in  determining  the  answer  to  t-jestiens  was  to  determine  if 

the  composite  Havy  and  miniature  aptitude  predictor  sco»,o  *e; ->  significantly  different 
across  the  23  black  and  25  white  low  aptitude  subjects .  V -test;  ware  performed  in  order 
to  ascertain  If  the  scores  on  the  predictors  did,  indeed,  exhibit  statistically  signif¬ 
icant  differences  between  races. 

Ho  statistically  significant  differences  were  f  ound  betk.an  the  scores  fer  the  black 
and  the  white  samples  on  both  the  Naby  predictors  and  tee  miniature  aptitude  tests  (t=0.94 
and  1,06  respectively,  p>.05).  It  was  found,  however,  mat  the  performance  on  th~  composite 
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field  criterion  data  did  approach  a  statistically  significant  difference  ft=  1.98)  in 
favor  cf  the  whites.  Examined on  of  these  data  indicate  that  the  difference  is  the  result 
of  an  accuaulatioc  cf~saall  differences  in  each  of  a  number  of  the  performance  criteria.  It 
is  possible  that  differential  job  exposure  was  afforded  the  whites  as  compared  to  the  blacks 
and  that  this  differential  exposure  contributed  to  the  differences  noted.  To  provide  ad¬ 
ditional  insight  into  the  differential  validity  question,  the  relationships  among  the  compo¬ 
site  miniature  aptitude  and  Navy  predictors  and  the  composite  field  criterion  for  both  blacks 
ni  whites  were  examined.  Table  4-6  presents  the  composite  Tr'edictor-coaposite  criterion 
rod act  moment  correlation  coefficients  for  both  black  and  tne  white  low  aptitude  subjects. 
iab.e  4-3  indicates  that  both  the  Nav  and  the  miniature  aptitude  predictor-criterion  ccr- 
reAation  coefficients  failed  to  achieve  desirable  predictive  levels  for  whites  in  terms  of 
the  composite  field  criterion.  However,  both  the  miniature  aptitude  and  the  Navy  tests  pre¬ 
dicted  the  performance  of  the  black  low  aptitude  machinist  mate  strikers  on  the  composite 
cr  •'  ter  ion.  Differential  validity  is  said  to  obtain  t3oehm,  1971)  when  the  correlation  co¬ 
efficients  for  two  groups  differ  significantly  from  zero  and  from  each  other.  The  correla¬ 
tion  coefficients  presented  in  Table  4-8  were  converted  to  s  scores  and  tests  of  the  statis¬ 
tical  signif icar.ce  of  the  difference  between  the  correlation  coefficients  for  the  black  and 
*-.e  white  low  aptitude  groups  were  completed .  Hone  of  the  differences  was  found  to  approach 
.05  level  of  confidence.  Accordingly,  both  of  the  predictor  batteries  fail  tc  meet  Soelea’s 
criteria  for  differencial  validity  and  can  not  be  held  to  be  differentially  valid. 


Table  4-8 

Pearson  Product  .Moment  Correlation  Coefficients  (by  Race)  Between 
Composite  Criterion  and  Composite  Predictor  Scores 


♦Statistically  significant  at  or  below  the  .05  level  of  confidence 


Supervisory  Interview  Analysis 


The  results  of  the  supervisory  interview  analysis  indicated  that  a  large 
proportion  (actually  a  majority)  of  the  lew  aptitude  sua^ects  to  have  performed, 
at  least,  adequately  (in  the  opinion  of  their'  immediate  supervisor)  irx  their 
first  six  months  of  service.  In  many  instances,  the  subjects  performed  at  a 
betrer  than  adequate  level. 

When  the  supervisors  were  asked  to  compare  the  performance  of  their  low 
aptitude  subordinates  with  the  performance  of  others,  at  a  similar  level  of  ex- 
perierce,  29  per  cent  were  judged  as  performing  better  than  most  others  at  a 
similar  level.  Fifty  per  cent  were  said  to  be  performing  about  the  same  as  most 
others  at  a  similar  level,  and  21  per  cent  were  said  to  be  performing  poorer  than 
most  others  at  a  similar  level.  Seventy-nine  per  cent  of  the  low  aptitude  subjects, 
then,  seen  to  be  performing  at  least  at  about  the  same  criterion  level  as  most 
others  with  similar  experience.  When  asked  to  compare  •'■he  lev  aptitude  subjects 
with  others  at  the  next  highest  level-  ten  per  cent  were  judged  as  performing  better 
than  most  others  at  the  next  highest  level  while  41  per  cent  of  the  subjects  were 
judged  as  performing  the  same  as  those  at  the  next  highest  level.  Forty-sight  per 
cent  were  performing  less  well  than  those  at  the  next  higr.est  level.  Roughly  half 
of  the  low  aptitude  subjects,  then,  were  performing  as  well  as  those  at  the  next 
highest  level. 

According  to  the  supervisors,  14  rer  cent  of  the  low  aptitude  subjects  exceeded 
the  standards  of  technical  performance  of  the  supervisors.  Sixty -five  per  cent  of 
the  subjects  met  their  supervisors'  standards  while  only  19  per  cent  failed  tc  meet 
those  standards.  Eighty-cne  per  cent  of  the  low  aptitude  sample,  then,  appear  to  be 
meeting  supervisory  performance  standards - 

Finally,  62  per  cent  of  the  supervisors  reported  that  they  would  choose  their 
particular  low  aptitude  subordinate  if  given  the  opportunity  of  choosing  subordi¬ 
nates.  Twenty-nine  per  cent  of  the  time  they  indicated  they  would  choose  him  if 
no  one  else  were  available.  In  only  nine  per  cent  of  the  cases  would  they  definitely 
not  choose  him  as  a  subordinate. 

The  supervisors  of  the  low  aptitude  subjects  were  also  asked  to  list  the 
strengths  of  the  low  aptitude  sample.  Table  u-9  lists  the  number  of  times  various 
strengths  were  mentioned  and  the  proportion  of  the  low  aptitude  sample  to  whom  each 
descriptor  was  applied. 

'(any  of  the  strengths  in  Table  w-9  are  net  thes.  which  one  would  ordinarily 
assoc. ate  with  a  low  aptitude  group.  Yet,  they  do  support  the  previous  data  re¬ 
garding  the  positive  performance  of  a  number  of  the  tow  aptitude  subjects. 
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Table  4-9 


Strengths*  of  Low  Aptitude  Navy 
Machinist  Mate  Strikers 


Strength 

Nuaber 

Proportion*5 

Motivation,  Initiative 

3? 

64 

Intelligent,  Quick  Learner 

15 

26 

Obeys  Orders 

14 

24 

Does  good  job.  Adequate  job 

11 

19 

Dependable,  Reliable 

9 

16 

Gets  along  with  others 

8 

14 

Mechanically  inclined 

6 

10 

None 

5 

8 

Learned  job 

4 

7 

Good  Behavior 

3 

S 

Good  Morale 

3 

5 

♦Strengths  and  strong  points  sentior.ed  only  once  were  not  included  in  this 
table.  The  N  for  this  table  is  58  not  S4. 

**  The  supervisors  of  four  low  aptitude  subjects  were  available  even  though 
the  subjects  theaselves  were  not  available  for  cr  terion  testing-  The 
sua of  the  proportions  are  greater  than  100,  inasRuch  as  soae  supervisors 
sentioned  norc  than  one  strength  cr  string  point  per  subject. 


Table  4-iG  lists  the  weaknesses  of  the  iow  aptitude  subjects,  as  indicated  by 
their  supervisors.  Table  4-10  indicates  that  p-*or  learning  ability  and  lack  of  Esotiva 
tier*  were  the  primary  problem  areas  exhibited  by  the  law  aptitude  subjects.  Other 
problems  .mentioned  with  intermediate  frequency  are  need  for  supervision  and  discipline 


Trfcle  4*  3  0 

Weaknesses'  of  Low  Au'ftude  Navy 
Machinist  Mate  Strikers 


Weakness 


Number 


Proportion** 


Poor  learner.  Slow  learner, 
forgetful 

16 

28 

No  motivation,  No  initiative, 

Lacy 

15 

26 

Needs  supervision 

10 

17 

None 

q 

16 

Discipline 

8 

14 

Verbal  and  Written  Material 

5 

S 

Withdrawn 

2 

4 

Ur.  trainable 

2 

4 

♦Weaknesses  mentioned  only  once  were  not  included  in  this  table.  The  N 
for  this  table  is  5S  not  54..  The  supervisors  of  four  low  aptitude 
subjects  were  available  even  though  the  subjects  themselves  were  not 
available  for  criterion  testing. 

**  The  proportions  are  greater  than  100,  inasmuch  as  some  supervisors 
mentioned  more  than  one  strength  or  strong  point  per  subject. 


Low  Aptitude  Interview  Analysis 

Thirty-nine  per  cent  of  the  tested  low  aptitude  subjects  indicated  that  they 
thought  that  t’.iey  were  net  given  a  fair  opportunity  to  learn  the  machinist  sate  rate 
The  chief  reasons  for  this  opinion,  as  related  by  the  respondents,  were  that: 


1.  they  were  not  properly  taught 

2.  they  were  ignored 

3.  they  were  assigned  menial  tasks 

4.  they  were  on  galley  duty 
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Some  of  th~  ether  interview  questions  provide  additional  insight  into  the 
reasons  for  the  perception  on  the  part  of  the  low  aptitude  sample  that  thev  were 
not  given  a  fair  opportunity  to  learn  the  rate.  Forty- four  per  cent  of  th-  sample 
reported  that  they  had  served  over  three  months  on  galley  duty.  In  addition,  33 
percent  indicated  that  they  had  little  opportunity  to  pack  valves,  break  flanges, 
or  make  gaskets.  Evidently,  the  request  included  with  the  assignment  of  each  low 
aptitude  subject  to  a  ship,  that  he  be  assigned  to  work  within  the  machinist  mate 
specialty,  was  ufter  ignored.  The  additional  request,  that  the  subject  be  given 
every  opportunity  to  learn  the  tasks  typical  of  the  rate  was  als  >  ignored  in  many 
instances.  Tne  implication  to  be  drawn  from  these  data,  then,  is  that  a  large  pro¬ 
portion  of  the  lew  aptitude  sample  was  not  given  the  opportunity  to  perform  in  or 
learn  the  machinist  mate  rate.  This  fact  probably  serves  to  introduce  a  random 
factor  into  our  predictor-criterion  relationships-*  Accordingly,  the  correlations 
reported  here  are,  in  all  likelihc<'d,  underestimates  of  true  predictor-criterion 
relationships. 

From  the  Navy  supervisor  point  of  vier,  one  could  say  *hat  a  large  proportion 
of  the  low  aptitude  subjects  are  slow  learners.  I he  supervisors  may  not  have  had 
the  time  or  the  knowledge/skills  to  administer  the  special  types  of  training  re¬ 
quired  for  slow  learners-  Hence,  the  slow  learner  was  placed  in  work  situations 
requiring  little  or  no  training  (e.g.,  menial  tasks). 

Stannary  -and  Conclusions 

This  research  program  was  based  on  a  testing  concept  related  to  "culture 
fairness"  and  specifically  to  an  interpretation  which  contends  that  if  an  individ¬ 
ual  can  learn  to  perform  a  job  sample,  he  can  also  learn  to  perform  the  total  job. 
The  demonstrated  ability  to  learn  selected  aspects  of  a  job  is  employed  as  a  pre¬ 
dictor  of  ability  to  learn  to  perform  the  total  job. 

The  machinist  mate  rate  was  selected  as  a  logical  first  rate  to  dc  initial 
research  with  the  innovative  testing  concept.  The  machinist  sate  rate  involves 
performance  of  tasks  which  are  largely  nonverbal  in  nature ,  and  thus  would  allow 
an  individual  who,  for  whatever  reason,  lacks  verbal-ability  to  excel. 

A  set  of  job- related  miniature  aptitude  tests  was  constructed  and  administered 
to  Uavy  recruits  who  had  failed  on  traditional  Navy  predictors  (presumably  because 
these  tests  are  ’’loaded"  with  verbally  culture-related  materials)  and  thus  were  not 
eligible  for  machinist  mate  training  in  the  A  school.  The  job  sample  involved  no 
written  learning  materials;  iier.ee,  the  job  sample  maximally  simulated  on-the-job 
training  situation  in  which  a  foreman  instructs  a  journeyman  in  job  performance . 
After  testing,  the  recruits  were  placed  aboard  ship  .r  assignment  as  an  entry  level 
machinist  sate.  Criterion  data  were  collected  after  the  recruits  had  six  months 
of  Fleet  experience.  Out  of  a  total  of  99  low  aptitude  recruits  tested  with  the 
miniature  aptitude  job  sample  tests,  performance  criterion  data  were  available  to  be 
collected  for  a  sample  of  54.  Criterion  data  ware  also  collected  on  2"  ft  school 
graduate  machinist  mate  strikers.  The  high  aptitude  machinist  mate  strikers  were 
found,  on  the  whole  to  perform  significantly  better  cn  the  perforaance  criterion 
shipboard  tests  than  did  the  low  aptitude  machinist  sates.  However,  members  cf 
the  low  aptitude  group  did,  in  many  cases  perfora  on  these  job  sample  performance 


♦Mote  the  low  relationships  found  between  the  predictors  and  the  flange  making  and 
valve  packing  criterion  tests. 


criterion  tests  as  well  as,  or  even  better  than,  some  of  the  high  aptitude  A  school 
graduates.  The  data  also  indicated  that  the  miniature  aptitude  test  predictors  corre¬ 
lated  higher  than  the  usual  Navy  predictors  with'  the  field  criteria  tests  for  the  low 
aptitude  sample,  and  that  the  miniature  aptitude  tests  accounted  for  mere  than  twice 
the  criterion  variance  as  compared  with  the  Navy  predictors. 

The  results  of  interviews  with  the  immediate  work  supervisors  of  the  low  aptitude 
sample  indicated  that  a  large  proportion  were  performing  at  an  acceptable  level.  Specif¬ 
ically,  81  per  cent  of  the  low  aptitude  sample  were  considered  by  their  supervisors  to  be 
performing  at  an  acceptable  level.  The  areas  most  frequently  mentioned  by  the  supervisors 
a;-,  descriptive  of  the  strengths  of  the  low  aptitude  sample  were:  "motivation- initiative, " 
"quick  learner,"  and  "obeys  order..."  The  most  frequently  mentioned  weak  areas  of  the  “low 
aptitude"  sample  were:  "poor  learner ,”"no  initiative,"  and  "needs  supervision." 

The  data  did  not  support  a  conclusion  that  the  miniature  aptitude  tests  are  differ¬ 
entially  valid.  Accordingly,  their  usefulness  for  both  black  and  wnite  low  aptitude  ma¬ 
chinist  mate  applicants  seems  supported,  at  least  tentatively. 

The  primary  goal  of  the  present  study — developing  a  culture  fair  technique  that  can 
validly  predict  performance  of  low  aptitude  Navy  machinist  mate  applicants- -has,  at  least 
partially,  been  achieved.  It  is  thus  suggested  that  tests  of  this  sort  represent  a  useful 
tool  in  regard  to  man-power  and  equity.  Similar  research  with  similar  tests  but  with  other 
Navy  rates  wouid  seem  to  be  an  appropriate  pursuit  in  the  future. 

The  next  phase  of  the  present  program  involves  following  as  many  of  the  initial 
sample  for  a  second  set  of  criterion  tests.  This  second  set  of  criterion  tests  will  be 
administered  after  the  sample  has  had  about  one  year  of  Fleet  experience. 
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APPENDIX  A 


Regression  Equations  for  Predicting  Criterion  Performance 
from  Navy  Basic  Baiterv  Test  Scores 


HECK  " 
BKF  = 
PV  =r 
TKl;  = 


KW-hS  - 
HW-  TS  = 


General  Classification  Test 

Arithmetic  Test 

Mechanical  Test 

Breaking  Making  a  Flange  Test 

Packing  a  Valve  Test 

Tool  Knowledge  and  Usage  Test 

Sequential  Picture  Arrange»ent  Test 

What’s  Wrong  Test 

Messenger  Watch-Meter  Reading  Test 
Messenger  Watch-Troubleshooting  Test 
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APPENDIX  B 


Regression  Equations  for  Predicting  Criterion  Performance 
from  ti^e  Miniature  Aptitude  Scores 


KEY: 

StHJ=  Equipment  Use  and  Nomenclature  Test 
GC=  Gasket  Cutting  Test 
KR=  Meter  Reading  Test 
TR=  Troubleshooting  Test 
£0=  Equipment  Operation  Test 
ASS=  Assembly  'rast 


S7 


(it:,  TR,  1:0,  ASS  as  Predictors  (N=  S4) 


level  insufficient  for  computation  of  other  variables 
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Regression  liquations  for  Preuietion  Criterion  I’orf'ormnm-e  on  the  MU -MW  Test  Using  fcUN,  CC 

MR,  TR,  MO,  and  AS.S  as  Predictors  (N«  54) 
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